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METHODS FOR mENTIFYING RISK OF OSTEOARTHRITIS AND 
TREATMENTS THEREOF 

Field of the Invention 

[0001] The invention relates to genetic methods for identifying risk of osteoarthritis and treatments 
that specifically target such diseases. 

Background 

[0002] Osteoarthritis (OA) is a chronic disease usually affecting weight-bearing synovial joints. 
There are approximately 20 million Americans affected by OA and it is the leading cause of disability in 
the United States. In addition to extensive human suffering, OA also accounts for nearly all knee 
replacements and more than half of all hip replacements in the United States. Despite its prevalence, OA 
is poorly imderstood and there are few treatments available besides anti-inflammatory drugs and joint 
replacement. 

[0003] Most commonly affecting middle-aged and older people, OA can range from very mild to 
very severe. It affects hands and weight-bearing joints such as knees, hips, feet and the back. Knee OA 
can be as disabling as any cardiovascular disease except stroke. 

[0004] OA IS characterized by the breakdown of cartilage in joints. Cartilage in joints cushions the 
ends of bones, and cartilage breakdown causes bones to rub against each other, causing pain and loss of 
movement. Type II collagen is the main component of cartilage, comprising 15-25% of the wet weight, 
approximately half the dry weighty and representing 90-95% of the total collagen content in the tissue. It 
forms fibrils that endow cartilage with tensile strength (Mayne, R. Arthritis Rhuem. 32:241-246 (1989)). 

Summary 

[0005] It has been discovered that certain polymorphic variations in human genomic DNA are 
associated with osteoarthritis. In particular, polymorphic variants in loci containing PSMBl, TBP, 
PDCD2, ELP3, CHDCl and ERG regions in htiman genomic DNA have been associated with risk of 
osteoarthritis. The PSMBl, TBP and PDCD2 regions are located in a leirger region referred to herein as 
the chrom 6 region. 

[0006] Thus, featured herein are methods for identifying a subject at risk of osteoarthritis and/or a 
risk of osteoarthritis in a subject, which comprise detecting the presence or absence of one or more 
polymorphic variations associated with osteoarthritis in or around the loci described herein in a human 
nucleic acid sample. In an embodiment, two or more poljmiorphic variations are detected in two or more 
regions, of which one or more is a PSMBl, TBP, PDCD2, ELP3. CHDCl or ERG region. In certain 
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embodiments, 3 or more, or 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19 or 20 or more 
polymorphic variants are detected. 

[0007] Also featured are nucleic acids that include one or more polymorphic variations associated 
with occurrence of osteoarthritis, as well as polypeptides encoded by these nucleic acids. In addition, 
provided are methods for identifying candidate therapeutic molecules for treating osteoarthritis, as well 
as methods for treating osteoarthritis in a subject by identifying a subject at risk of osteoarthritis and 
treating the subject with a suitable prophylactic, treatment or therapeutic molecule. 

[0008] Also provided are compositions comprising a cell from a subject having osteoarthritis or at 
risk of osteoarthritis and/or a PSMBl, TBP, PDCD2, ELP3. CHDCl or ERG nucleic acid, with a RNAi, 
siRNA, antisense DNA or RNA, or ribozyme nucleic acid designed from a PSMBl, TBP, PDCD2, ELP3, 
CHDCl or ERG nucleotide sequence. In an embodiment, the RNAi, siRNA, antisense DNA or RNA, or 
ribozyme nucleic acid is designed from a PSMBl, TBP, PDCD2, ELP3, CHDCl or ERG nucleotide 
sequence that includes one or more polymorphic variations associated with osteoarthritis, and in some 
instances, specifically interacts with such a nucleotide sequence. Further, provided are arrays of nucleic 
acids bound to a solid surface, in which one or more nucleic acid molecules of the array have a PSMBl, 
TBP, PDCD2, ELP3, CHDCl or ERG nucleotide sequence, or a fragment or substantially identical 
nucleic acid thereof, or a complementary nucleic acid of the foregoing. Featured also are compositions 
comprising a cell from a subject having osteoarthritis or at risk of osteoarthritis and/or a PSMBl, TBP, 
PDCD2, ELP3. CHDCl or ERG polypeptide, with an antibody that specifically binds to the polypeptide. 
In an embodiment, the antibody specifically binds to an epitope in the polypeptide that includes a non- 
synonymous amino acid modification associated with osteoarthritis (e.g., results in an amino acid 
substitution in the encoded polypeptide associated with osteoarthritis). In certain embodiments, the 
antibody selectively binds to an epitope in the PSMBl, TBP, PDCD2. ELP3, CHDCl or ERG 
polypeptide having an amino acid associated with osteoarthritis. 

Brief Description of the Drawings 

[0009] Figures 1 A-ID show proximal SNPs in chrom 6, ELP3, CHDCl and ERG regions of 
genomic DNA, respectively. The position of each SNP in the chromosome is shown on the x-axis and 
the y-axis provides the negative logarithm of the p-value comparing the estimated allele to that of the 
control group. Also shown in the figures are exons and introns of the regions in the approximate 
chromosomal positions. 

Detailed D escription 

[0010] It has been discovered that a polymorphic variant in a locus containing a PSMBl, TBP, 
PDCD2, ELP3, CHDCl or ERG region is associated with occiurence of osteoarthritis in subjects. Thus, 
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detecting genetic determinants associated with an increased risk of osteoarthritis occurrence can lead to 
early identification of a predisposition to osteoarthritis and early prescription of preventative measures. 
Also, associating a PSMBl, TBP, PDCD2, ELP3, CHDCl or ERG polymorphic variant with 
osteoarthritis has provided new targets for screening molecules useful in treatments of osteoarthritis. 

Osteoarthritis and Sample Selection 

[0011] Osteoarthritis (OA), or degenerative joint disease, is one of the oldest and most common 
types of arthritis. It is characterized by the breakdown of the joint's cartilage. Cartilage is the part of the 
joint that cushions the ends of bones, and its breakdown causes bones to rub against each other, causing 
pain and loss of movement. Type II collagen is the main component of cartilage, comprising 15-25% of 
the wet weight, approximately half the dry weight, and representing 90-95% of the total collagen content 
in the tissue. It forms fibrils that endow cartilage with tensile strength (Mayne, R. Arthritis Rhuem. 
32:241-246(1989)). 

[0012] Most commonly affecting middle-aged and older people, OA can range fi-om very mild to 
very severe. It affects hands and weight-bearing joints such as knees, hips, feet and the back. Knee OA . 
can be as disabling as any cardiovascular disease except stroke. 

[0013] Osteoarthritis affects an estimated 20.7 million Americans, mostly after age 45, with women 
more commonly affected than men. Physicians make a diagnosis of OA based on a physical exam and 
history .of symptoms. X-rays are used to confirm diagnosis. Most people over 60 reflect the disease on 
X-ray, and about one-third have actual symptoms. 

[0014] There are many factors that can cause OA. Obesity may lead to osteoarthritis of the knees. 
In addition, people with joint injuries due to sports, work-related activity or accidents may be at increased 
risk of developing OA. 

[0015] Genetics has a role in the development of OA. Some people may be bom with defective 
cartilage or with slight defects in the way that joints fit together. As a person ages, these defects may 
cause early cartilage breakdown in the joint or the inability to repair damaged or deteriorated cartilage in 
the joint. 

[0016] Inclusion or exclusion of samples for an osteoarthritis pool may be based upon the following 
criteria: ethnicity (e.g., samples derived fi-om an individual characterized as Caucasian); parental 
ethnicity (e.g., samples derived from an individual of British paternal and maternal descent); relevant 
phenotype information for the individual (e.g., case samples derived from individuals diagnosed with 
specific knee osteoarthritis (OA) and were recruited from an OA knee replacement clinic). Control 
samples may be selected based on relevant phenotype information for the individual (e.g., derived from 
individuals free of OA at several sites (knee, hand, hip etc)); and no family history of OA and/or 
rheumatoid arthritis. Additional phenotype information collected for both cases and controls may include 
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age of the individual, gender, family history of OA, diagnosis with osteoarthritis (joint location of OA, 
date of primary diagnosis, age of individual as of primary diagnosis), knee history (current symptoms, 
any major knee injury, menisectomy, knee replacement surgery, age of surgery), HRT history, 
osteoporosis diagnosis. 

[0017] Based in part upon selection criteria set forth above, individuals having osteoarthritis can be 
selected for genetic studies. Also, individuals having no history of osteoarthritis often are selected for 

genetic studies, as described hereafter. 

Polymorphic Variants Associated with Osteoarthritig 

[0018] A genetic analysis provided herein linked osteoarthritis with polymorphic variant nucleic 
acid sequences in the human genome. As used herein, the term "polymorphic site" refers to a region in a 
nucleic acid at which two or more alternative nucleotide sequences are observed in a significant number 
of nucleic acid samples from a population of individuals. A polymorphic site may be a nucleotide 
sequence of two or more nucleotides, an inserted nucleotide or nucleotide sequence, a deleted nucleotide 
or nucleotide sequence, or a microsatellite, for example. A polymorphic site that is two or more 
nucleotides in length may be 3, 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 13, 14, 15 or more, 20 or more, 30 or more, 50 
or more, 75 or more, 100 or more, 500 or more, or about 1000 nucleotides in length, where all or some of 
the nucleotide sequences differ within the region. A polymorphic site is often one nucleotide in length, 
which is referred to herein as a "single nucleotide polymorphism" or a "SNP." 

[0019] Where there are two, three, or foiu- alternative nucleotide sequences at a polymorphic site, 
each nucleotide sequence is referred to as a "polymorphic variant" or "nucleic acid variant." Where two 
polymorphic variants exist, for example, the polymorphic variant represented in a minority of samples 
from a population is sometimes referred to as a "minor allele" and the poljonorphic variant that is more 
prevalently represented is sometimes referred to as a "major allele." Many organisms possess a copy of 
each chromosome (e.g., humans), and those individuals who possess two major alleles or two minor 
alleles are often referred to as being "homozygous" with respect to the polymorphism, and those 
individuals who possess one major allele and one minor allele are normally referred to as being 
"heterozygous" with respect to the polymorphism. Individuals who are homozygous with respect to one 
allele are sometimes predisposed to a different phenotype as compared to individuals who are 
heterozygous or homozygous with respect to another allele. 

[0020] In genetic analysis that associate polymorphic variants with osteoarthritis, samples from 
individuals having osteoarthritis and individuals not having osteoarthritis often are allelotyped and/or 
genotjrped. The term "allelotype" as used herein refers to a process for determining the allele frequency 
for a polymorphic variant in pooled DNA samples from cases and controls. By pooling DNA from each 
group, an allele frequency for each SNP in each group is calculated. These allele frequencies are then 
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compared to one another. The term "genotyped" as used herein refers to a process for determining a 
genotype of one or more individuals, where a "genotype" is a representation of one or more polymorphic 

variants in a population. 

[0021] A genotype or polymorphic variant may be expressed in terms of a "haplotype," which as 
used herein refers to two or more polymorphic variants occurring within genomic DNA in a group of 
individuals within a population. For example, two SNPs may exist within a gene where each SNP 
position includes a cytosine variation and an adenine variation. Certain individuals in a population may 
carry one allele (heterozygous) or two alleles (homo2ygous) having the gene with a cytosine at each SNP 
position. As the two cytosines corresponding to each SNP in the gene travel together on one or both 
alleles in these individuals, the individuals can be characterized as having a cytosine/cytosine haplotype 
with respect to the two SNPs in the gene. 

[0022] As used herein, the term "phenotype" refers to a trait which can be compared between 
individuals, such as presence or absence of a condition, a visually observable difference in appearance 
between individuals, metabolic variations, physiological variations, variations in the function of 
biological molecules, and the like. An example of a phenotype is occurrence of osteoarthritis. 

[0023] Researchers sometimes report a polymorphic variant in a database without determining 
whether the variant is represented in a significant Section of a population. Because a subset of these 
reported polymorphic variants are not represented in a statistically significant portion of the population, 
some of them are sequencing errors and/or not biologically relevant. Thus, it is often not known whether 
a reported polymorphic variant is statistically significant or biologically relevant until the presence of the 
variant is detected in a population of individuals and the fi^uency of the variant is determined. Methods 
for detecting a polymorphic variant in a population are described herein, specifically in Example 2. A 
polymorphic variant is statistically significant and often biologically relevant if it is represented in 5% or 
more of a population, sometimes 10% or more, 15% or more, or 20% or more of a population, and often 
25% or more, 30% or more, 35% or more, 40% or more, 45% or more, or 50% or more of a population. 

[0024] A polymorphic variant may be detected on either or both strands of a double-stranded nucleic 
acid. Also, a polymorphic variant may be located within an intron or exon of a gene or within a portion 
of a regulatory region such as a promoter, a 5 ' untranslated region (UTR), a 3 ' UTR, and in DNA (e.g. , 
genomic DNA (gDNA) and complementary DNA (cDNA)), RNA (e.g., mRNA, tRNA, and rRNA), or a 
polypeptide. Polymorphic variations may or may not result in detectable differences in gene expression, 
polypeptide structvire, or polypeptide fianction. 

[0025] It was determined that polymorphic variations associated with an increased risk of 
osteoarthritis existed in IhePSMBl, TBP. PDCD2, ELP3, CHDCl ov ERG regions. In certain 
embodiments, polymorphic variants at positions rs756519, rs 1042327, rs8770, rs 1563055, rs9 12428 and 
rsl 888475 in the human genome were associated with an increased risk of osteoarthritis, and in specific 
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embodiments, a thymine at position rs756519, a cytosine at position rs 1042327, a cytosine at position 
rs8770, a thymine at position rsl563055, a thymine at position rs912428 and an adenine at position 
rsl 888475 were associated with an increased risk of osteoarthritis. 

[0026] Polymorphic variants in and around the chrom 6 region were tested for association with 
osteoarthritis. These include polymorphic variants at positions in SEQ ID NO: 1 selected from the group 
consisting of 229, 6310, 11840, 11870, 12064, 13392, 16354, 16559, 16935, 17616, 17737, 18321, 
18453, 18811, 20020, 21662, 23197, 23446, 24339, 25504, 27174, 28008, 29294, 29759, 30832, 44512, 
44850, 45884, 46345, 48589, 53371, 53911, 53990, 55152, 55667, 58952, 59315, 60029, 61477, 62988, 
63090, 64021, 65685, 70220, 70323, 70959, 73436, 82945, 82958, 82961, 82964, 82965, 83006, 83025, 
83034, 83074 ,83132, 83155. 83172, 83174, 83206, 83216, 83234, 83252, 83260, 83263, 83296, 83319, 
83322, 83324, 83357, 83375, 83381, 83389, 83443, 83499, 83545, 83566, 83591, 83619, 83698, 83780, 
83784, 83826, 83832, 83852, 86297, 86315, 86420, 86460, 86714, 86718, 86736, 86753, 86766, 88162, 
88218, 88246, 88255, 88309, 88310, 88471, 88619, 88904, 89044, 90531, 90534, 90613 and 46252. 
Polymorphic variants at the following positions in SEQ ID NO: 1 in particular were associated with an 
increased risk of osteoarthritis: 229, 6310, 16559, 18453, 25504, 27174, 30832, 44850, 45884, 48589, 
61477, 82961 and 46252, with specific embodiments directed to variants at positions 229, 16559, 44850 
and/or 46252. In particular, the following polymorphic variants in SEQ ID NO: 1 were associated with 
risk of osteoarthritis: a thymine at position 229, a guanine at position 6310, a thymine at position 16559, 
an adenine at position 18453, an adenine at position 25504, an adenine at position 27174, an adenine at 
position 30832, a guanine at position 44850, an adenine at position 45884, an adenine at position 48589, 
a cytosine at position 61477, a cytosine at position 82961 and a thymine at position 46252. 

[0027] Polymorphic variants in and around the ELP3 region were tested for association with 
osteoarthritis. These include polymorphic variants at positions in SEQ ID NO: 2 selected fi-om the group 
consisting of 2 11, 473, 1536, 5639, 17186, 17335,25029, 25111,28811,28863, 30809,40985, 45147, 
45282, 46168, 46328, 49077, 51925, 52141, 52168, 60852, 62468, 65572, 79089, 79541, 79790, 90843, 
90978, 91052, 91 131, 91 132, 94439 and 94621. Polymorphic variants at the following positions in SEQ 
ED NO: 2 in particular were associated with an increased risk of osteoarthritis: 40985, 46168, 5 1925 and 
52168. In particular, the following polymorphic variants in SEQ ED NO: 2 were associated with risk of 
osteoarthritis: a cytosine at position 40985, a guanine at position 46168, a thymine at position 51925 and 
a cytosine at position 52168. 

[0028] Polymorphic variants in and around the CHDCl region were tested for association with 
osteoarthritis. These include polymorphic variants at positions in SEQ ID NO: 3 selected from the group 
consisting of 243, 10208, 15049, 15111, 15272, 15287, 15326, 15327, 17038. 19391,21702, 22431, 
22881, 27744, 32564, 32698, 33104, 33181, 33256, 33543, 35567, 40085, 40482, 45641, 46059, 48504, 
48919, 49693, 49874, 50020, 50616, 50719, 55511, 65533, 70529, 75591, 77266, 80368, 82475, 92462, 
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92480, 95819 and 96275. Polymorphic variants at the following positions in SEQ ID NO: 3 in particular 
were associated with an increased risk of osteoarthritis: 15111, 45641, 46059, 49693, 49874, 50020, 
50719, 70529, 82475, 92462, 92480 and 96275, with specific embodiments directed to variants at 
positions 82475 and/or 92462. In particular, the following polymorphic variants in SEQ ID NO: 3 were 
associated with risk of osteoarthritis: a guanine at position 1 5 1 1 1 , a thymine at position 45641 , an 
adenine at position 46059, a cytosine at position 49693, an adenine at position 49874, an adenine at 
position 50020, a guanine at position 50719, an adenine at position 70529, an adenine at position 82475, 
a thymine at position 92462, a thymine at position 92480 and a cytosine at position 96275. 

[0029] Polymorphic variants in and around the ERG region were tested for association with 
osteoarthritis. These include polymorphic variants at positions in SEQ ID NO: 4 selected from the group 
consisting of 231, 882, 960, 1194, 1530, 1673, 2096, 2285, 5873, 7256, 7988, 8222, 8381, 8814, 8915, 
9642, 9902, 10619, 10927, 11032, 14377, 15608, 15928, 16296, 17598, 19272, 20084, 20577, 28051, 
29466, 29530, 29987, 30012, 30322, 32216, 32516, 32544, 32746, 33137, 33538, 33798, 33802, 33964, 
34132, 34210, 34317, 34499, 34753, 34845, 35335, 36423, 36450, 36481, 38447, 38784, 39387, 39458, 
39822, 40305, 40869, 40926, 41010, 41 134, 41984, 42172, 42753, 4301 1, 43176, 43320, 43381, 44142, 
44383, 44726, 45087, 45141, 45359, 45421, 45456, 45467, 45486, 45709, 45716, 47626, 49413, 49796, 
49962, 50075, 50093, 50571, 50615, 50780, 50851, 51459, 53193, 53702, 53736, 53795, 54109, 54126, 
54230, 54894. 55455, 55499, 56522. 56662, 56954, 57267, 58282, 58916, 59544, 59666, 59913, 66846, 
67245, 67652, 67955, 67966, 68420. 70226, 70810. 72246, 73330, 73457, 74389, 74638, 74640, 75358, 
75952, 76098, 77836, 78449, 78507, 80031, 81695. 82775, 82795, 84611, 84657, 84693, 85020, 85048, 
85100, 85325, 85452. 85868, 85936. 85990. 86139. 86497, 87236, 87248, 87533, 87912, 88108, 88494, 
89598, 90235, 91287. 91359, 92384, 92410, 92900, 94495, 94512, 97777 and 98333. Polymorphic 
varizints at the following positions in SEQ ID NO: 4 in particular were associated with an increased risk 
of osteoarthritis: 1673, 20577, 33137, 39822, 45716, 49962, 51459, 54894, 55455, 55499, 58282, 68420 
and 80031, with specific embodiments directed to variants at positions 33137, 55499 and/or 58282. In 
particular, the following polymorphic variants in SEQ ID NO: 4 were associated with risk of 
osteoarthritis: a guanine at position 1673, a thymine at position 20577, a guanine at position 33137, a 
guanine at position 39822, an adenine at position 45716. a guanine at position 49962, an adenine at 
position 51459, a cytosine at position 54894, an adenine at position 55455, an adenine at position 55499, 
a guanine at position 58282, an adenine at position 68420 and a thymine at position 80031 . 

[0030] Based in part upon analyses summarized in Figures 1 A-ID, regions with significant 
association have been identified in loci associated with osteoarthritis. Any polymorphic variants 
associated with osteoarthritis in a region of significant association can be utilized for embodiments 
described herein. For example, polymorphic variants in a region spanning chromosome positions 
170719500 to 170766500 (approximately 47,000 nucleotides in length) in a chrom 6 region, spanning 
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chromosome positions 27963000 to 27983000 (approximately 20,000 nucleotides in length) in a ELP3 
region, spanning chromosome positions 44962000 to 45013000 (approximately 51,000 nucleotides in 
length) in a CHDCl region, and spanning chromosome positions 38830000 to 38844000 (approximately 
14,000 nucleotides in length) in a ERG region, have significant association (chromosome positions are 
within NCBI's Genome build 34). 

Additional Polymorphic Variants Associated with Osteoarthritis 

[0031] Also provided is a method for identifying polymorphic variants proximal to an incident, 
founder polymorphic variant associated with osteoarthritis. Thus, featured herein are methods for 
identifying a polymorphic variation associated with osteoarthritis that is proximal to an incident 
polymorphic variation associated with osteoarthritis, which comprises identifying a polymorphic variant 
proximal to the incident polymorphic variant associated with osteoarthritis, where the incident 
polymorphic variant is in a PSMBl. TBP, PDCD2. ELP3, CHDCl or ERG nucleotide sequence. The 
nucleotide sequence often comprises a polynucleotide sequence selected from the group consisting of (a) 
a polynucleotide sequence of SEQ ID NO: 1-12; (b) a polynucleotide sequence that encodes a 
polypeptide having an amino acid sequence encoded by a polynucleotide sequence of SEQ ID NO: 1-12; 
and (c) a polynucleotide sequence that encodes a polypeptide having an amino acid sequence that is 90% 
or more identical to an amino acid sequence encoded by a nucleotide sequence of SEQ ID NO: 1 or a 
polynucleotide sequence 90% or more identical to the polynucleotide sequence of SEQ ID NO: 1-12. 
The presence or absence of £in association of the proximal polymorphic variant with osteoarthritis then is 
determined using a known association method, such as a method described in the Examples hereafter. In 
an embodiment, the incident polymorphic variant is a polymorphic variant associated with osteoarthritis 
described herein. In another embodiment, the proximal polymorphic variant identified sometimes is a 
publicly disclosed polymorphic variant, which for example, sometimes is published in a publicly 
available database. In other embodiments, the polymorphic variant identified is not publicly disclosed 
and is discovered using a known method, including, but not limited to, sequencing a region surroimding 
the incident polymorphic variant in a group of nucleic samples. Thus, multiple polymorphic variants 
proximal to an incident polymorphic variant are associated with osteoarthritis using this method. 

[0032] The proximal polymorphic variant often is identified in a region surrounding the incident 
polymorphic variant. In certain embodiments, this surrounding region is about 50 kb flanking the fu-st 
polymorphic variant {e.g. about 50 kb 5' of the first polymorphic variant and about 50 kb 3' of the first 
polymorphic variant), and the region sometimes is composed of shorter flanking sequences, such as 
flanking sequences of about 40 kb, about 30 kb, about 25 kb, about 20 kb, about 15 kb, about 10 kb, 
about 7 kb, about 5 kb, or about 2 kb 5' and 3' of the incident polymorphic variant. In other 
embodiments, the region is composed of longer flanking sequences, such as flanking sequences of about 
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55 kb, about 60 kb, about 65 kb, about 70 kb, about 75 kb, about 80 kb, about 85 kb, about 90 kb, about 
95 kb, or about 100 kb 5' and 3' of the incident polymorphic variant. 

[0033] hi certain embodiments, polymorphic variants associated with osteoarthritis are identified 
iteratively. For example, a first proximal polymorphic variant is associated with osteoarthritis using the 
methods described above and then another polymorphic variant proximal to the first proximal 
polymorphic variant is identified (e.g., publicly disclosed or discovered) and the presence or absence of 
an association of one or more other polymorphic variants proximal to the first proximal polymorphic 
variant with osteoarthritis is determined. 

[0034] The methods described herein are useful for identifying or discovering additional 
polymorphic variants that may be used to further characterize a gene, region or loci associated with a 
condition, a disease (e.g. , osteoarthritis), or a disorder. For example, allelotyping or genotyping data 
from the additional polymorphic variants may be used to identify a functional mutation or a region of 
linkage disequilibrium. In certain ernbodiments, polymorphic veiriants identified or discovered within a 
region comprising the first polymorphic variant associated with osteoarthritis are genotyped using the 
genetic methods and sample selection techniques described herein, and it can be determined whether 
those polymorphic variants are in linkage disequilibrium with the first polymorphic variant. The size of 
the region in linkage disequilibrium with the first polymorphic variant also can be assessed using these 
genotyping methods. Thus, provided herein are methods for determining whether a polymorphic variant 
is in linkage disequilibrium with a first polymorphic variant associated with osteoarthritis, and such 
information can be used in prognosis/diagnosis methods described herein. 

Isolated Nucleic Acids 

[0035] Featured herein are isolated PSMBl, TBP, PDCD2. ELP3, CHDCl or ERG nucleic acid 
variants depicted in SEQ ED NO: 1-12, and substantially identical nucleic acids thereof. A nucleic acid 
variant may be represented on one or both strands in a double-stranded nucleic acid or on one 
chromosomal complement (heterozygous) or both chromosomal complements (homozygous). 

[0036] As used herein, the term "nucleic acid" includes DNA molecules (e.g., a complementary 
DNA (cDNA) and genomic DNA (gDNA)) and RNA molecules (e.g. , mRNA, rRNA, siRNA and tRNA) 
and analogs of DNA or RNA, for example, by use of nucleotide analogs. The nucleic acid molecule can 
be single-stranded and it is often double-stranded. The term "isolated or purified nucleic acid" refers to 
nucleic acids that are separated from other nucleic acids present in the natural source of the nucleic acid. 
For example, with regard to genomic DNA, the term "isolated" includes nucleic acids which are 
separated fi-om the chromosome with which the genomic DNA is naturally associated. An "isolated" 
nucleic acid is often free of sequences which naturally flank the nucleic acid (i.e., sequences located at 
the 5' and/or 3' ends of the nucleic acid) in the genomic DNA of the organism fi-om which the nucleic 
9 
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acid is derived. For example, in various embodiments, the isolated nucleic acid molecule can contain less 
than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb or 0.1 kb of 5' and/or 3' nucleotide sequences which flank 
the nucleic acid molecule in genomic DNA of the cell from which the nucleic acid is derived. Moreover, 
an "isolated" nucleic acid molecule, such as a cDNA molecule, can be substantially free of other cellular 
material, or culture medium when produced by recombinant techniques, or substantially free of chemical 
precursors or other chemicals when chemically synthesized. As used herein, the term "gene" refers to a 
nucleotide sequence that encodes a polypeptide. 

[0037] Also included herein are nucleic acid fragments. These fragments often have a nucleotide 
sequence identical to a nucleotide sequence of SEQ ED NO: 1-12, a nucleotide sequence substantially 
identical to a nucleotide sequence of SEQ ID NO: 1 -1 2, or a nucleotide sequence that is complementary 
to the foregoing. The nucleic acid fragment may be identical, substantially identical or homologous to a 
nucleotide sequence in an exon or an intron in a nucleotide sequence of SEQ ID NO: 1-12, and may 
encode a domain or part of a domain of a polypeptide. Sometimes, the fragment will comprises one or 
more of the polymorphic variations described herein as being associated with osteoarthritis. The nucleic 
acid fragment is often 50, 100, or 200 or fewer base pairs in length, and is sometimes about 300, 400, 500, 
600, 700, 800, 900, 1000, 1 100, 1200, 1300, 1400, 1500, 2000, 3000, 4000, 5000, 10000, 15000, or 20000 
base pairs in length. A nucleic acid fragment that is complementary to a nucleotide sequence identical or 
substantially identical to a nucleotide sequence in SEQ ID NO: 1-12 and hybridizes to such a nucleotide 
sequence under stringent conditions is often referred to as a "probe." Nucleic acid fragments often include 
one or more polymorphic sites, or sometimes have an end that is adjacent to a polymorphic site as described 
hereafter. 

[0038] An example of a nucleic acid fragment is an oligonucleotide. As used herein, the term 
"oUgonucleotide" refers to a nucleic acid comprising about 8 to about 50 covalently linked nucleotides, 
often comprising from about 8 to about 35 nucleotides, and more often from about 10 to about 25 
nucleotides. The backbone and nucleotides within an oligonucleotide may be the same as those of 
naturally occurring nucleic acids, or analogs or derivatives of naturally occurring nucleic acids, provided 
that oligonucleotides having such analogs or derivatives retain the ability to hybridize specifically to a 
nucleic acid comprising a targeted polymorphism. Oligonucleotides described herein may be used as 
hybridization probes or as components of prognostic or diagnostic assays, for example, as described 
herein. 

[0039] Oligonucleotides are typically sjoithesized using standard methods and equipment, such as 
the ABI™3900 High Throughput DNA Synthesizer and the EXPEDITE™ 8909 Nucleic Acid 
Synthesizer, both of which are available frt>m Applied Biosystems (Foster City, CA). Analogs and 
derivatives are exemplified in U.S. Pat. Nos. 4,469,863; 5,536,821; 5,541,306; 5,637,683; 5,637,684; 
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5,700,922; 5,717,083; 5,719,262; 5,739,308; 5,773,601; 5,886,165; 5,929,226; 5,911,296; 6,140,482; 
WO 00/56746; WO 01/14398, and related publications. Methods for synthesizing oligonucleotides 
comprising such analogs or derivatives are disclosed, for example, in the patent publications cited above 
and in U.S. Pat. Nos. 5,614,622; 5,739,314; 5,955,599; 5,962,674; 6,1 17,992; in WO 00/75372; and in 
related publications. 

[0040] Oligonucleotides may also be linked to a second moiety. The second moiety may be an 
additional nucleotide sequence such as a tail sequence (e.g. , a polyadenosine tail), an adapter sequence 
(e.g., phage Ml 3 universal tail sequence), and others. Alternatively, the second moiety may be a non- 
nucleotide moiety such as a moiety which facilitates linkage to a solid support or a label to facilitate 
detection of the oligonucleotide. Such labels include, without limitation, a radioactive label, a 
fluorescent label, a chemiluminescent label, a paramagnetic label, and the like. The second moiety may 
be attached to any position of the oligonucleotide, provided the oligonucleotide can hybridize to the 
nucleic acid comprising the polymorphism. 

Uses for Nucleic Acid Sequence 

[0041] Nucleic acid coding sequences may be used for diagnostic purposes for detection and control 
of polypeptide expression. Also, included herein are oligonucleotide sequences such as antisense RNA, 
small-interfering RNA (siRNA) and DNA molecules and ribozymes that function to inhibit translation of 
a polypeptide. Antisense techniques and RNA interference techniques are known in the art and are 
described herein. 

[0042] Ribozymes are enzymatic RNA molecules capable of catalyzing the specific cleavage of 
RNA. The mechanism of ribozyme action involves sequence specific hybridization of the ribozyme 
molecule to complementary target RNA, followed by endonucleolytic cleavage. For example, 
hammerhead motif ribozyme molecules may be engineered that specifically and efficiently catalyze 
endonucleolytic cleavage of RNA sequences corresponding to or complementary to a PSMBl, TBP, 
PDCD2, ELP3, CHDCl or ERG nucleotide sequence. Specific ribozyme cleavage sites within any 
potential RNA target are initially identified by scarming the target molecule for ribozjmie cleavage sites 
which include the following sequences, GUA, GUU and GUC. Once identified, short RNA sequences of 
between fifteen (15) and twenty (20) ribonucleotides corresponding to the region of the target gene 
containing the cleavage site may be evaluated for predicted structural features such as secondary 
structure that may render the oUgonucleotide sequence unsuitable. The suitability of candidate targets 
may also be evaluated by testing their accessibility to hybridization with complementary 
oligonucleotides, using ribonuclease protection assays. 

[0043] Antisense RNA and DNA molecules, siRNA and ribozymes may be prepared by any method 
known in the art for the synthesis of RNA molecules. These include techniques for chemically 
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synthesizing oligodeoxyribonucleotides well known in the art such as solid phase phosphoramidite 
chemical synthesis. Alternatively, RNA molecules may be generated by in vitro and in vivo transcription 
of DNA sequences encoding the antisense RNA molecule. Such DNA sequences may be incorporated 
into a wide variety of vectors which incorporate suitable RNA polymerase promoters such as the T7 or 
SP6 polymerase promoters. Alternatively, antisense cDNA constructs that synthesize antisense RNA 
constitutively or inducibly, depending on the promoter used, can be introduced stably into cell lines. 

[0044] DNA encoding a polypeptide also may have a number of uses for the diagnosis of diseases, 
including osteoarthritis, resulting from aberrant expression of a target gene described herein. For 
example, the nucleic acid sequence may be used in hybridization assays of biopsies or autopsies to 
diagnose abnormalities of expression or function (e.g.. Southern or Northern blot analysis, in situ 
hybridization assays). 

[0045] In addition, the expression of a polypeptide during embryonic development may also be 
determined using nucleic acid encoding the polypeptide. As addressed, infra, production of functionally 
impaired polypeptide is the cause of various disease states, such as osteoarthritis. In situ hybridizations 
using polypeptide as a probe may be employed to predict problems related to osteoarthritis. Further, as 
indicated, infra, administration of human active polypeptide, recombinantly produced as described 
herein, may be used to treat disease states related to fimctionally impaired polypeptide. Alternatively, 
gene therapy approaches may be employed to remedy deficiencies of functional polypeptide or to replace 
or compete with dysfunctional polypeptide. 

Expression Vectors. H ost Cells, and Genetically Engineered Cells 

[0046] Provided herein are nucleic acid vectors, often expression vectors, which contain a PSMBl, 
TBP, PDCD2, ELP3. CHDCl or ERG nucleotide sequence, or a substantially identical sequence thereof. 
As used herein, the term "vector" refers to a nucleic acid molecule capable of transporting another 
nucleic acid to which it has been linked and can include a plasmid, cosmid, or viral vector. The vector 
can be capable of autonomous replication or it can integrate into a host DNA. Viral vectors may include 
replication defective retroviruses, adenoviruses and adeno-associated viruses for example. 

[0047] A vector can include a PSMBl, TBP, PDCD2, ELP3, CHDCl or ERG nucleotide sequence 
in a form suitable for expression of an encoded target polypeptide or target nucleic acid in a host cell. A 
"target polypeptide" is a polypeptide encoded by a PSMBl, TBP, PDCD2, ELP3, CHDCl or ERG 
nucleotide sequence, or a substantially identical nucleotide sequence thereof. The recombinant 
expression vector typically includes one or more regulatory sequences operatively linked to the nucleic 
acid sequence to be expressed. The term "regulatory sequence" includes promoters, enhancers and other 
expression control elements {e.g., polyadenylation signals). Regulatory sequences include those that 
direct constitutive expression of a nucleotide sequence, as well as tissue-specific regulatory and/or 
12 
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inducible sequences. The design of the expression vector can depend on such factors as the choice of the 
host cell to be transformed, the level of expression of polypeptide desired, and the like. Expression 
vectors can be introduced into host cells to produce target polypeptides, including fusion polypeptides. 

[0048] Recombinant expression vectors can be designed for expression of target polypeptides in 
prokaryotic or eukaryotic cells. For example, target polypeptides can be expressed in E. coli, insect cells 
{e.g., using baculo virus expression vectors), yeast cells, or mammalian cells. Suitable host cells are 
discussed further in Gbeddel, Gene Expression Technology: Methods in Enzymology 185, Academic 
Press, San Diego, CA (1 990). Alternatively, the recombinant expression vector can be transcribed and 
translated in vitro, for example using T7 promoter regulatory sequences and T7 polymerase. 

[0049] Expression of polypeptides in prokaryotes is most often carried out in E. coli with vectors 
containing constitutive or inducible promoters directing the expression of either fusion or non-fusion 
polypeptides. Fusion vectors add a number of amino acids to a polypeptide encoded therein, usually to 
the amino terminus of the recombinant polypeptide. Such fusion vectors typically serve three purposes: 
1) to increase expression of recombinant polypeptide; 2) to increase the solubility of the recombinant 
polj^eptide; and 3) to aid in the purification of the recombinant polypeptide by acting as a ligand in 
affinity purification. Often, a proteolytic cleavage site is introduced at the jxmction of the fusion moiety 
and the recombinant polypeptide to enable separation of the recombinant polypeptide from the fiision 
moiety subsequent to pvirification of the fusion polypeptide. Such enzymes, and their cognate 
recognition sequences, include Factor Xa, thrombin and enterokinase. Typical fusion expression vectors 
include pGEX (Pharmacia Biotech Inc; Smith & Johnson, Gene 67: 31-40 (1988)), pMAL (New England 
Biolabs, Beverly, MA) and pRTTS (Pharmacia, Piscataway, NJ) which fuse glutathione S-transferase 
(GST), maltose E binding polypeptide, or polypeptide A, respectively, to the target recombinant 
polypeptide. 

[0050] Purified fusion polypeptides can be used in screening assays and to generate antibodies 
specific for target polypeptides. In a therapeutic embodiment, fusion polypeptide expressed in a 
retroviral expression vector is used to infect bone marrow cells that are subsequently transplanted into 
irradiated recipients. The pathology of the subject recipient is then examined after sufficient time has 
passed ie.g., six (6) weeks). 

[0051] Expressing the polypeptide in host bacteria with an impaired capacity to proteolytically 
cleave the recombinant polypeptide is often used to maximize recombinant polypeptide expression 
(Gottesman, S., Gene Expression Technology: Methods in Enzymology, Academic Press, San Diego, 
California 185: 1 19-128 (1990)). Another strategy is to alter the nucleotide sequence of the nucleic acid 
to be inserted into an expression vector so that the individual codons for each amino acid are those 
preferentially utilized in E. coli (Wada et at.. Nucleic Acids Res. 20." 21 1 1-21 1 8 (1992)). Such aheration 
of nucleotide sequences can be carried out by standard DNA synthesis techniques. 
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[0052] When used in mammalian cells, the expression vector's control functions are often provided 
by viral regulatory elements. For example, commonly used promoters are derived from polyoma. 
Adenovirus 2, cytomegalovirus and Simian Virus 40. Recombinant mammalian expression vectors are 
often capable of directing expression of the nucleic acid in a particular cell type ie.g., tissue-specific 
regulatory elements are used to express the nucleic acid). Non-limiting examples of suitable tissue- 
specific promoters include an albumin promoter (liver-specific; Pinkert et al., Genes Dev. 1: 268-277 
(1987)), lymphoid-specific promoters (Calame & Eaton, Adv. Immunol. 43: 235-275 (1988)), promoters 
of T cell receptors (Winoto & Baltimore, EMBOJ. 8: 729-733 (1989)) promoters of immunoglobulins 
(Banerji et al., Cell 33: 729-740 (1 983); Queen & Baltimore, Cell 33: 741-748 (1983)), neuron-specific 
promoters (e.g., the neurofilament promoter; Byrne & Ruddle, Proc. Natl. Acad. Set. USA 86: 5473- 
5477 (1989)), pancreas-specific promoters (Edlund et al.. Science 230: 912-916 (1985)), and mammary 
gland-specific promoters iis.g., milk whey promoter; U.S. Patent No. 4,873,316 and European 
Application Publication No. 264,166). Developmentally-regulated promoters are sometimes utilized, for 
example, the murine hox promoters (Kessel & Gruss, Science 249: 374-379 (1990)) and the a- 
fetopolypeptide promoter (Campes & Tilghman, Genes Dev. 3: 537-546 (1989)). 

[0053] APSMBl. TBP. PDCD2, ELP3, CHDCl or ERG nucleic acid also may be cloned into an 
expression vector in an antisense orientation. Regulatory sequences {e.g., viral promoters and/or 
enhancers) operatively linked to a PSMBl, TBP, PDCD2, ELP3, CHDCl or ERG nucleic acid cloned in 
the antisense orientation can be chosen for directing constitutive, tissue specific or cell type specific 
expression of antisense RNA in a variety of cell types. Antisense expression vectors can be in the form 
of a recombinant plasinid, phagemid or attenuated vims. For a discussion of the regulation of gene 
expression using antisense genes see, e.g., Weintraub et al., Antisense RNA as a molecular tool for 
genetic analysis. Reviews - Trends in Genetics, Vol. 1(1) (1986). 

[0054] Also provided herein are host cells that include a PSMBl, TBP. PDCD2, ELP3, CHDCl or 
ERG nucleotide sequence within a recombinant expression vector or a fragment of such a nucleotide 
sequence which facilitate homologous recombination into a specific site of the host cell genome. The 
terms "host cell" and "recombinant host cell" are used interchangeably herein. Such terms refer not only 
to the particular subject cell but rather also to the progeny or potential progeny of such a cell. Because 
certain modifications may occur in succeeding generations due to either mutation or environmental 
influences, such progeny may not, in fact, be identical to the parent cell, but are still included within the 
scope of the term as used herein. A host cell can be any prokaryotic or eukaryotic cell. For example, a 
target polypeptide can be expressed in bacterial cells such as E. coli, insect cells, yeast or tiuimmalian 
cells (such as Chinese hamster ovary cells (CHO) or COS cells). Other suitable host cells are known to 
those skilled in the art. 
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[0055] Vectors can be introduced into host cells via conventional transformation or transfection 
techniques. As used herein, the terms "transformation" and "transfection" are intended to refer to a 
variety of art-recognized techniques for introducing foreign nucleic acid (e.g., DNA) into a host cell, 
including calcium phosphate or calcium chloride co-precipitation, transduction/infection, DEAE-dextran- 
mediated transfection, lipofection, or electroporation. 

[0056] A host cell provided herein can be used to produce (i.e., express) a target polypeptide or a 
substantially identical polypeptide thereof Accordingly, ftirther provided are methods for producing a 
target polypeptide using host cells described herein. In one embodiment, the method includes culturing 
host cells into which a recombinant expression vector encoding a target polypeptide has been introduced 
in a suitable medium such that a target polypeptide is produced. In another embodiment, the method 
fiarther includes isolating a target polypeptide from the medium or the host cell. 

[0057] Also provided are cells or purified preparations of cells which include a PSMBl, TBP, 
PDCD2, ELP3, CHDCl or ERG transgene, or which otherwise misexpress target polypeptide. Cell 
preparations can consist of himian or non-human cells, e.g., rodent cells, e.g., mouse or rat cells, rabbit 
cells, or pig cells. In preferred embodiments, the cell or cells include a PSMBl, TBP, PDCD2, ELP3, 
CHDCl or ERG transgene (e.g., a heterologous form of a PSMBl, TBP, PDCD2, ELP3. CHDCl or ERG 
gene, such as a human gene expressed in non-human cells). The transgene can be misexpressed, e.g., 
overexpressed or mderexpressed. In other preferred embodiments, the cell or cells include a gene which 
misexpress an endogenous target polypeptide (e.g., expression of a gene is disrupted, also known as a 
knockout). Such cells can serve as a model for studying disorders which are related to mutated or mis- 
expressed alleles or for use in drug screening. Also provided are hiunan cells (e.g., a hematopoietic stem 
cells) transfected with a PSMBl, TBP, PDCD2, ELP3. CHDCl or ERG nucleic acid. 

[0058] Also provided are cells or a purified preparation thereof (e.g. , human cells) in which an 
endogenous PSMBl, TBP, PDCD2, ELP3, CHDCl or ERG nucleic acid is under the control of a 
regulatory sequence that does not normally control the expression of the endogenous gene. The 
expression characteristics of an endogenous gene within a cell (e.g., a cell line or microorganism) can be 
modified by inserting a heterologous DNA regulatory element into the genome of the cell such that the 
inserted regulatory element is operably linked to the corresponding endogenous gene. For example, an 
endogenous corresponding gene (e.g. , a gene which is "transcriptionally silent," not normally expressed, 
or expressed only at very low levels) may be activated by inserting a regulatory element which is capable 
of promoting the expression of a normally expressed gene product in that cell. Techniques such as 
targeted homologous recombinations, can be used to insert the heterologous DNA as described in, e.g., 
Chappel, US 5,272,071; WO 91/06667, published on May 16, 1991. 
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Transgenic Animals 

[0059] Non-human transgenic animals that express a heterologous target polypeptide ie.g., 
expressed from a PSMBl, TBP, PDCD2, ELP3, CHDCl or ERG nucleic acid or substantially identical 
sequence thereof) can be generated. Such animals are usefiil for studying the function and/or activity of a 
target polypeptide and for identifying and/or evaluating modulators of the activity of a PSMBl, TBP. 
PDCD2, ELP3, CHDCl or ERG nucleic acid or encoded polypeptide. As used herein, a "transgenic 
animal" is a non-human animal such as a mammal {e.g., a non-human primate such as chimpanzee, 
baboon, or macaque; an ungulate such as an equine, bovine, or caprine; or a rodent such as a rat, a mouse, 
or an Israeli sand rat), a bird (e.g., a chicken or a turkey), an amphibian (e.g., a frog, salamander, or 
newt), or an insect (e.g., Drosophila melanogaster), in which one or more of the cells of the animal 
includes a transgene. A transgene is exogenous DNA or a rearrangement (e.g., a deletion of endogenous 
chromosomal DNA) that is often integrated into or occurs in the genome of cells in a transgenic animal. 
A transgene can direct expression of an encoded gene product in one or more cell types or tissues of the 
transgenic animal, and other transgenes can reduce expression (e.g., a knockout). Thus, a transgenic 
animal can be one in which an endogenous nucleic acid homologous to a PSMBl, TBP, PDCD2, ELP3, 
CHDCl or ERG nucleic acid has been altered by homologous recombination between the endogenous 
gene and an exogenous DNA molecule introduced into a cell of the animal (e.g., an embryonic cell of the 
animal) prior to development of the animal. 

[00601 Intronic sequences and polyadenylation signals can also be included in the transgene to 
increase expression efficiency of the transgene. One or more tissue-specific regulatory sequences can be 
operably linked to a PSMBl, TBP, PDCDZ, ELP3, CHDCl or ERG nucleotide sequence to direct 
expression of an encoded polypeptide to particular cells. A transgenic founder animal can be identified 
based upon the presence of a PSMBl, TBP, PDCD2, ELP3, CHDCl or ERG nucleotide sequence in its 
genome and/or expression of encoded mRNA in tissues or cells of the animals. A transgenic founder 
animal can then be used to breed additional animals carrying the transgene. Moreover, transgenic 
animals carrying a PSMBl, TBP, PDCD2, ELP3, CHDCl or ERG nucleotide sequence can further be 
bred to other transgenic animals carrying other transgenes. 

[0061] Target polypeptides can be expressed in transgenic animals or plants by introducing, for 
example, a PSMBl, TBP, PDCD2, ELP3, CHDCl or ERG nucleic acid into the genome of an animal that 
encodes the target polypeptide. In preferred embodiments the nucleic acid is placed under the control of 
a tissue specific promoter, e.g. , a milk or egg specific promoter, and recovered from the milk or eggs 
produced by the animal. Also included is a population of cells from a transgenic animal. 
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Target Polypeptides 

[0062] Also featured herein are isolated target polypeptides, which are encoded by a PSMBl, TBP, 
PDCD2, ELP3, CHDCl or ERG nucleotide sequence (e.g., SEQ ID NO: 1-12), or a substantially 
identical nucleotide sequence thereof. Examples of PSMBl. TBP. PDCD2, ELP3, CHDCl or ERG 
polypeptides are set forth in SEQ ID NO: 13-20. The term "polj^eptide" as used herein includes 
proteins and peptides. An "isolated" or "purified" polypeptide or protein is substantially free of cellular 
material or other contaminating proteins fi-om the cell or tissue source fi-om which the protein is derived, 
or substantially free from chemical precursors or other chemicals when chemically synthesized. In one 
embodiment, the language "substantially free" means preparation of a target polypeptide having less than 
about 30%, 20%, 10% and more preferably 5% (by dry weight), of non-target polypeptide (also referred 
to herein as a "contaminating protein"), or of chemical precursors or non-target chemicals. When the 
target polypeptide or a biologically active portion thereof is recombinantly produced, it is also preferably 
substantially free of culture medium, specifically, where culture medium represents less than about 20%, 
sometimes less than about 10%, and often less than about 5% of the volume of the polypeptide 
preparation. Isolated or purified target polypeptide preparations are sometimes 0.01 milligrams or more 
or 0.1 milligrams or more, and often 1.0 milligrams or more and 10 milligrams or more in dry weight. 

[0063] Further included herein are target polypeptide fragments. The polypeptide fragment may be 
a domain or part of a domain of a target polypeptide. The polypeptide fragment may have increased, 
decreased or unexpected biological activity. The polypeptide fragment is often 50 or fewer, 100 or fewer, 
or 200 or fewer amino acids in length, and is sometimes 300, 400, 500, 600, 700, or 900 or fewer amino 
acids in length. Certain embodiments are directed to PSMBl polypeptide fragments (e.g., sequence accessed 
by NP_002784; rs756519 in Table A), such as a proteasome protease domain (e.g., starting at about amino 
acid 34 and ending at about amino acid 226) or a proteasome B domain (e.g., starting at about amino acid 41 
and ending at about amino acid 88). 

[0064] Substantially identical target polypeptides may depart from the amino acid sequences of 
target polypeptides in different manners. For example, conservative amino acid modifications may be 
introduced at one or more positions in the amino acid sequences of target polypeptides. A "conservative 
amino acid substitution" is one in which the amino acid is replaced by another amino acid having a 
similar structure and/or chemical fixnction. Femiilies of amino acid residues having similar structures and 
fimctions are well known. These families include amino acids with basic side chains {e.g., lysine, 
arginine, histidine), acidic side chains (e.g., aspartic acid, glutamic acid), uncharged polar side chains 
{e.g., glycine, asparagine, glutamine, serine, threonine, tyrosine, cysteine), nonpolar side chains {e.g., 
alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan), beta-branched side 
chains {e.g., threonine, valine, isoleucine) and aromatic side chains {e.g., tyrosine, phenylalanine, 
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tryptophan, histidine). Also, essential and non-essential amino acids may be replaced. A "non-essential" 
amino acid is one that can be altered without abolishing or substantially altering the biological fimction 
of a target polypeptide, whereas altering an "essential" amino acid abolishes or substantially alters the 
biological function of a target polypeptide. Amino acids that are conserved among target polypeptides 
are typically essential amino acids. In certain embodiments, the polypeptide includes one or more non- 
synonymous polymorphic variants associated with osteoarthritis. 

[0065] Also, target polypeptides may exist as chimeric or fiision polypeptides. As used herein, a 
target "chimeric polypeptide" or target "fusion polypeptide" includes a target polypeptide linked to a 
non-target polypeptide. A "non-target polypeptide" refers to a polypeptide having an amino acid 
sequence corresponding to a polypeptide which is not substantially identical to the target polypeptide, 
which includes, for example, a polypeptide that is different from the target polypeptide and derived from 
the same or a different organism. The target polypeptide in the fusion polypeptide can correspond to an 
entire or nearly entire target polypeptide or a fragment thereof. The non-target polypeptide can be fiised 
to the N-terminus or C-terminus of the target polypeptide. 

[0066] Fusion polypeptides can include a moiety having high affinity for a ligand. For example, the 
fiision polypeptide can be a GST-target fusion polypeptide in which the target sequences are fused to the 
C-terminus of the GST sequences, or a polyhistidine-target fusion polypeptide in which the target 
polypeptide is fiised at the N- or C-terminus to a string of histidine residues. Such fusion polypeptides 
can facilitate purification of recombinant target polypeptide. Expression vectors are commercially 
available that already encode a fusion moiety (e.g., a GST polypeptide), and a nucleotide sequence in 
SEQ ID NO: 1-12, or a substantially identical nucleotide sequence thereof, can be cloned into an 
expression vector such that the fusion moiety is linked in-frame to the target polypeptide. Further, the 
fusion polypeptide can be a target polypeptide containing a heterologous signal sequence at its N- 
terminus. In certain host cells (e.g., mammalian host cells), expression, secretion, cellular internalization, 
and cellular localization of a target polypeptide can be increased through use of a heterologous signal 
sequence. Fusion polypeptides can also include all or a part of a serum polypeptide (e.g., an IgG constant 
region or human serum albxmiin). 

[0067] Target polypeptides can be incorporated into pharmaceutical compositions and administered 
to a subject in vivo. Administration of these target polypeptides can be used to affect the bioavailability 
of a substrate of the target polypeptide and may effectively increase target polypeptide biological activity 
in a cell. Target fiision polypeptides may be useful therapeutically for the treatment of disorders caused 
by, for example, (i) aberrant modification or mutation of a gene encoding a target polypeptide; (ii) mis- 
regulation of the gene encoding the target polypeptide; and (iii) aberrant post-translational modification 
of a target polypeptide. Also, target polypeptides can be used as immxmogens to produce anti-target 
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antibodies in a subject, to purify target polypeptide ligands or binding partners, and in screening assays to 
identify molecules which inhibit or enhance the interaction of a target polypeptide with a substrate. 

[0068] In addition, polypeptides can be chemically synthesized using techniques known in the art 
(See, e.g., Creighton, 1983 Proteins. New York, N.Y.: W. H. Freeman and Company; and Hunkapiller et 
al., (1984) Nature July 12 -18;310(5973):105-1 1). For example, a relative short fragment can be 
synthesized by use of a peptide synthesizer. Furthermore, if desired, non-classical amino acids or 
chemical amino acid analogs can be introduced as a substitution or addition into the fragment sequence. 
Non-classical amino acids include, but are not limited to, to the D-isomers of the common amino acids, 
2,4-diaminobutjaic acid, a-amino isobutyric acid, 4-aminobutyric acid, Abu, 2-amino butyric acid, g- 
Abu, e-Ahx, 6 -amino hexanoic acid, Aib, 2-amino isobutyric acid, 3 -amino propionic acid, ornithine, 
norleucine, norvaline, hydroxyproline, sarcosine, citruUine, homocitruUine, cysteic acid, t-butylglycine, t- 
butylalanine, phenylglycine, cyclohexylalamine, b-alanine, fluoroamino acids, designer amino acids such 
as b-methyl amino acids, Ca-methyl amino acids, Na-methyl amino acids, £ind amino acid einalogs in 
general. Furthermore, the amino acid can be D (dextrorotary) or L (levorotary). 

[0069] Polypeptides and polypeptide fragments sometimes are differentially modified during or after 
translation, e.g., by glycosylation, acetylation, phosphorylation, amidation, derivatization by known 
protecting/blocking groups, proteolytic cleavage, linkage to an antibody molecule or other cellular ligand, 
etc. Any of numerous chemical modifications may be carried out by known techniques, including but not 
limited, to specific chemical cleavage by cyanogen bromide, trypsin, chymotrypsin, papain, V8 protease, 
NaBH4; acetylation, formylation, oxidation, reduction; metabolic synthesis in the presence of 
tunicamycin; and the like. Additional post-translational modifications include, for example, N-linked or 
O-linked carbohydrate chains, processing of N-terminal or C-terminal ends), attachment of chemical 
moieties to the amino acid backbone, chemical modifications of N-linked or O-linked carbohydrate 
chains, and addition or deletion of an N-terminal methionine residue as a result of prokaryotic host cell 
expression. The polypeptide fi:agments may also be modified with a detectable label, such as an 
enzymatic, fluorescent, isotopic or affinity label to allow for detection and isolation of the polj^jeptide. 

[0070] Also provided are chemically modified derivatives of polypeptides that can provide 
additional advantages such as increased solubility, stability and circulating time of the polypeptide, or 
decreased immunogenicity (see e.g., U.S. Pat. No: 4,179,337. The chemical moieties for derivitization 
may be selected firom water soluble polymers such as polyethylene glycol, ethylene glycol/propylene 
glycol copolymers, carboxymethylcellulose, dextran, polyvinyl alcohol and the like. The polypeptides 
may be modified at random positions within the molecule, or at predetermined positions within the 
molecule and may include one, two, three or more attached chemical moieties. 

[0071] The polymer may be of any molecular weight, and may be branched or unbranched. For 
polyethylene glycol, the preferred molecular weight is between about 1 kDa and about 100 kDa (the term 
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"about" indicating that in preparations of polyethylene glycol, some molecules will weigh more, some 
less, than the stated molecular weight) for ease in handling and manufacturing. Other sizes may be used, 
depending on the desired therapeutic profile (e.g., the duration of sustained release desired, the effects, if 
any on biological activity, the ease in handling, the degree or lack of antigenicity and other known effects 
of the polyethylene glycol to a therapeutic protein or analog). 

[0072] The polymers should be attached to the polypeptide with consideration of effects on 
fiinctional or antigenic domains of the polypeptide. There are a number of attachment methods available 
to those skilled in the art (e.g., EP 0 401 384 (coupling PEG to G-CSF) and Malik et al. (1992) Exp 
Hematol. September;20(8): 1028-35 (pegylation of GM-CSF using tresyl chloride)). For example, 
polyethylene glycol may be covalently bound through amino acid residues via a reactive group, such as a 
free amino or carboxyl group. Reactive groups are those to which an activated polyethylene glycol 
molecule may be bound. The amino acid residues having a free amino group may include lysine residues 
and the N-terminal amino acid residues; those having a free carboxyl group may include aspartic acid 
residues, glutamic acid residues and the C-terminal amino acid residue. Sulfhydryl groups may also be 
used as a reactive group for attaching the polyethylene glycol molecules. For therapeutic purposes, the 
attachment sometimes is at an amino group, such as attachment at the N-terminus or lysine group. 

[0073] Proteins can be chemically modified at the N-terminus. Using polyethylene glycol as an 
illustration of such a composition, one may select from a variety of polyethylene glycol molecules (by 
molecular weight, branching, and the like), the proportion of polyethylene glycol molecules to protein 
(polypeptide) molecules in the reaction mix, the type of pegylation reaction to be performed, and the 
method of obtaining the selected N-terminally pegylated protein. The method of obtaining the N- 
terminally pegylated preparation (i.e., separating this moiety from other monopegylated moieties if 
necessary) may be by purification of the N-terminally pegylated material from a population of pegylated 
protein molecules. Selective proteins chemically modified at the N-terminus may be accomplished by 
reductive alkylation, which exploits differential reactivity of different types of primjiry amino groups 
(lysine versus the N-terminal) available for derivatization in a particular protein. Under the appropriate 
reaction conditions, substantially selective derivatization of the protein at the N-terminus with a carbonyl 
group containing polymer is achieved. 

Substantially Identical Nucleic Ac ids and Polypeptides 

[0074] Nucleotide sequences and polypeptide sequences that are substantially identical to a PSMBl, 
TBP, PDCD2. ELP3, CHDCl or ERG nucleotide sequence and the target polypeptide sequences encoded 
by those nucleotide sequences, respectively, are included herein. The term "substantially identical" as 
used herein refers to two or more nucleic acids or polypeptides sharing one or more identical nucleotide 
sequences or polypeptide sequences, respectively. Included are nucleotide sequences or polypeptide 
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sequences that are 55% or more, 60% or more, 65% or more, 70% or more, 75% or more, 80% or more, 
85% or more, 90% or more, 95%> or more (each often within a 1%, 2%, 3% or 4% variabiUty) identical to 
a PSMBl, TBP, PDCD2, ELP3, CHDCl or ERG nucleotide sequence or the encoded target polypeptide 
amino acid sequences. One test for determining whether two nucleic acids are substantially identical is to 
determine the percent of identical nucleotide sequences or polypeptide sequences shared between the 
nucleic acids or polypeptides. 

[0075] Calculations of sequence identity are often performed as follows. Sequences are aligned for 
optimal comparison purposes {e.g., gaps can be introduced in one or both of a first and a second amino 
acid or nucleic acid sequence for optimal alignment and non-homologous sequences can be disregarded 
for comparison purposes). The length of a reference sequence aligned for comparison purposes is 
sometimes 30% or more, 40%) or more, 50%. or more, often 60%. or more, and more often 70%. or more, 
80% or more, 90% or more, or 100% of the length of the reference sequence. The nucleotides or amino 
acids at corresponding nucleotide or polypeptide positions, respectively, are then compared among the 
two sequences. When a position in the first sequence is occupied by the same nucleotide or amino acid 
as the corresponding position in the second sequence, the nucleotides or amino acids are deemed to be 
identical at that position. The percent identity between the two sequences is a function of the number of 
identical positions shared by the sequences, taking into accoimt the number of gaps, and the length of 
each gap, introduced for optimal edignment of the two sequences. 

[0076] Comparison of sequences and determination of percent identity between two sequences can 
be accomplished using a mathematical algorithm. Percent identity between two amino acid or nucleotide 
sequences can be determined using the algorithm of Meyers «& Miller, CABIOS 4: 11-17 (1989), which 
has been incorporated into the ALIGN program (version 2.0), using a PAM120 weight residue table, a 
gap length penalty of 12 and a gap penalty of 4. Also, percent identity between two amino acid 
sequences can be determined using the Needleman & Wimsch, J. Mol. Biol. 48: 444-453 (1970) 
algorithm which has been incorporated into the GAP program in the GCG software package (available at 
the http address www.gcg.com), using either a Blossum 62 matrix or a PAM250 matrix, and a gap weight 
of 16, 14, 12, 10, 8, 6, or 4 and a length weight of 1, 2, 3, 4, 5, or 6. Percent identity between two 
nucleotide sequences can be determined using the GAP program in the GCG software package (available 
at http address www.gcg.com), using a NWSgapdna.CMP matrix and a gap weight of 40, 50, 60, 70, or 
80 and a length weight of 1, 2, 3, 4, 5, or 6. A set of parameters often used is a Blossimi 62 scoring 
matrix with a gap open penalty of 12, a gap extend penalty of 4, and a frameshift gap penalty of 5. 

[0077] Another manner for determining if two nucleic acids are substantially identical is to assess 
whether a polynucleotide homologous to one nucleic acid will hybridize to the other nucleic acid under 
stringent conditions. As use herein, the term "stringent conditions" refers to conditions for hybridization 
and washing. Stringent conditions are known to those skilled in the art and can be foimd in Current 
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Protocols in Molecular Biology, John Wiley & Sons, N.Y. , 6.3.1-6.3.6 (1989). Aqueous and non- 
aqueous methods are described in that reference and either can be used. An example of stringent 
hybridization conditions is hybridization in 6X sodium chloride/sodium citrate (SSC) at about 45°C, 
followed by one or more washes in 0.2X SSC, 0.1% SDS at 50°C. Another example of stringent 
hybridization conditions are hybridization in 6X sodiimi chloride/sodium citrate (SSC) at about 45°C, 
followed by one or more washes in 0.2X SSC, 0.1% SDS at 55°C. A ftirther example of stringent 
hybridization conditions is hybridization in 6X sodium chloride/sodium citrate (SSC) at about 45°C, 
followed by one or more washes in 0.2X SSC, 0.1% SDS at 60°C. Often, stringent hybridization 
conditions are hybridization in 6X sodium chloride/sodium citrate (SSC) at about 45''C, followed by one 
or more washes in 0.2X SSC, 0.1% SDS at 65°C. More often, stringency conditions are 0.5M sodium 
phosphate, 7% SDS at 65°C, followed by one or more washes at 0.2X SSC, 1% SDS at 65°C. 

[0078] An example of a substantially identical nucleotide sequence to a nucleotide sequence in SEQ 
ID NO: 1-12 is one that has a different nucleotide sequence but still encodes the same polypeptide 
sequence encoded by the nucleotide sequence in SEQ ID NO: 1-12. Another example is a nucleotide 
sequence that encodes a polypeptide having a polypeptide sequence that is more than 70% or more 
identical to, sometimes more than 75% or more, 80% or more, or 85% or more identical to, and often 
more than 90% or more and 95% or more identical to a polypeptide sequence encoded by a nucleotide 
sequence in SEQ ID NO: 1-12. 

[0079] Nucleotide sequences in SEQ ID NO: 1-12 and amino acid sequences of encoded 
poljTJeptides can be used as "query sequences" to perform a search against public databases to identify 
other family members or related sequences, for example. Such searches can be performed using the 
NBLAST and XBLAST programs (version 2.0) of Altschul et al. J. Mol. Biol. 215: 403-10 (1990). 
BLAST nucleotide searches can be performed with the NBLAST program, score = 100, wordlength = 12 
to obtain nucleotide sequences homologous to nucleotide sequences in SEQ ID NO: 1-12. BLAST 
polypeptide searches can be performed with the XBLAST program, score = 50, wordlength = 3 to obtain 
amino acid sequences homologous to polypeptides encoded by the nucleotide sequences of SEQ ID NO: 
1-12. To obtain gapped alignments for comparison purposes. Gapped BLAST can be utilized as 
described in Altschul et al.. Nucleic Acids Res. 25(17): 3389-3402 (1997). When utilizing BLAST and 
Gapped BLAST programs, default parameters of the respective programs {e.g., XBLAST and NBLAST) 
can be used {see the http address www.ncbi.nlm.nih.gov). 

[0080] A nucleic acid that is substantially identical to a nucleotide sequence in SEQ ID NO: 1 may 
include polymorphic sites at positions equivalent to those described herein when the sequences are 
aligned. For example, using the aUgnment procediires described herein, SNPs in a sequence substantially 
identical to a sequence in SEQ ID NO: 1-12 can be identified at nucleotide positions that match {i.e., 
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align) with nucleotides at SNP positions in each nucleotide sequence in SEQ ED NO: 1-12. Also, where a 
polymorphic variation results in an insertion or deletion, insertion or deletion of a nucleotide sequence 
from a reference sequence can change the relative positions of other polymorphic sites in the nucleotide 
sequence. 

[00811 Substantially identical nucleotide and polypeptide sequences include those that are naturally 
occurring, such as allelic variants (same locus), splice variants, homologs (different locus), and orthologs 
(different organism) or can be non-naturally occurring. Non-naturally occurring variants can be 
generated by mutagenesis techniques, including those applied to polynucleotides, cells, or organisms. 
The variants can contain nucleotide substitutions, deletions, inversions and insertions. Variation can 
occur in either or both the coding and non-coding regions. The variations can produce both conservative 
and non-conservative amino acid substitutions (as compared in the encoded product). Orthologs, 
homologs, allelic variants, and splice variants can be identified using methods known in the art. These 
variants normally comprise a nucleotide sequence encoding a polypeptide that is 50% or more, about 
55% or more, often about 70-75% or more or about 80-85% or more, and sometimes about 90-95% or 
more identical to the amino acid sequences of target polypeptides or a fragment thereof. Such nucleic 
acid molecules can readily be identified as being able to hybridize imder stringent conditions to a 
nucleotide sequence in SEQ ID NO: 1-12 or a fragment of this sequence. Nucleic acid molecules 
corresponding to orthologs, homologs, and allelic variants of a nucleotide sequence in SEQ ID NO: 1-12 
can ftiTther be identified by mapping the sequence to the same chromosome or locus as the nucleotide 
sequence in SEQ ID NO: 1-12. 

[0082] Also, substantially identical nucleotide sequences may include codons that are altered with 
respect to the naturally occurring sequence for enhancing expression of a target polypeptide in a 
particular expression system. For example, the nucleic acid can be one in which one or more codons are 
altered, and often 10% or more or 20% or more of the codons are altered for optimized expression in 
bacteria (e.g., E. coli.), yeast (e.g., S. cervesiae), himian (e.g., 293 cells), insect, or rodent (e.g., hamster) 
cells. 

Methods for Identifying Risk of osteoarthritis 

[0083] Methods for prognosing and diagnosing osteoarthritis are included herein. These methods 
include detecting the presence or absence of one or more polymorphic variations in a nucleotide sequence 
associated with osteoarthritis, such as variants in or around the loci set forth herein, or a substantially 
identical sequence thereof, in a sample from a subject, where the presence of a polymorphic variant 
described herein is indicative of a risk of osteoarthritis. Determining a risk of osteoarthritis sometimes 
refers to determining whether an individual is at an increased risk of osteoarthritis (e.g., intermediate risk 
or higher risk). 
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[0084] Thus, featured herein is a method for identifying a subject who is at risk of osteoarthritis, 
which comprises detecting an aberration associated with osteoarthritis in a nucleic acid sample from the 
subject. An embodiment is a method for detecting a risk of osteoarthritis in a subject, which comprises 
detecting the presence or absence of a polymorphic variation associated with osteoarthritis at a 
polymorphic site in a nucleotide sequence m a nucleic acid sample from a subject, where the nucleotide 
sequence comprises a polynucleotide sequence selected from the group consisting of: (a) a nucleotide 
sequence of SEQ ID NO: 1-12; (b) a nucleotide sequence which encodes a polypeptide consisting of an 
amino acid sequence encoded by a nucleotide sequence of SEQ ID NO: 1-12; (c) a nucleotide sequence 
which encodes a polypeptide that is 90% or more identical to an amino acid sequence encoded by a 
nucleotide sequence of SEQ ID NO: 1-12, or a nucleotide sequence about 90% or more identical to a 
nucleotide sequence of SEQ ID NO: 1-12; and (d) a fragment of a nucleotide sequence of (a), (b), or (c) 
comprising the polymorphic site; whereby the presence of the polymorphic variation is indicative of a 
predisposition to osteoarthritis in the subject. In certain embodiments, polymorphic variants at the 
positions described herein eire detected for determining a risk of osteoarthritis, and polymorphic variants 
at positions in linkage disequilibrium with these positions are detected for determining a risk of 
osteoarthritis. As used herein, "SEQ ID NO: 1-12" refers to individual sequences of SEQ ID NO: 1, 2, 3, 
4, 5, 6, 7, 8, 9, 10, 1 1 or 12, each sequence being separately applicable to embodiments described herein. 

[0085] Risk of osteoarthritis sometimes is expressed as a probability, such as an odds ratio, 
percentage, or risk factor. Risk often is based upon the presence or absence of one or more polymorphic 
variants described herein, and also may be based in part upon phenotypic traits of the individual being 
tested. Methods for calculating risk based upon patient data are well known (see, e.g., Agresti, 
Categorical Data Analysis, 2nd Ed. 2002. Wiley). Allelotyping and genotyping analyses may be carried 
out in populations other than those exemplified herein to enhance the predictive power of the prognostic 
method. These fiuther analyses are executed in view of the exemplified procedures described herein, and 
may be based upon the same polymorphic variations or additional polymorphic variations. 

[0086] In certain embodiments, determining the presence of a combination of two or more 
polymorphic variants associated with osteoarthritis in one or more genetic loci (e.g., one or more genes) 
of the sample is determined to identify, quantify and/or estimate, risk of osteoarthritis. The risk often is 
the probability of having or developing osteoarthritis. The risk sometimes is expressed as a relative risk 
with respect to a population average risk of osteoarthritis, and sometimes is expressed as a relative risk 
with respect to the lowest risk group. Such relative risk assessments often are based upon penetrance 
values determined by statistical methods, and are particularly useful to clinicians and insurance 
companies for assessing risk of osteoarthritis (e.g., a clinician can target appropriate detection, prevention 
and therapeutic regimens to a patient after determining the patient's risk of osteoarthritis, and an 
insurance company can fine time actuarial tables based upon population genotype assessments of 
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osteoarthritis risk). Risk of osteoarthritis sometimes is expressed as an odds ratio, which is the odds of a 
particular person having a genotype has or will develop osteoarthritis with respect to another genotype 
group (e.g., the most disease protective genotype or population average). In related embodiments, the 
determination is utilized to identify a subject at risk of osteoarthritis. In an embodiment, two or more 
polymorphic variations are detected in two or more regions in human genomic DNA associated with 
increased risk of osteoarthritis, such as a locus containing a PSMBl, TBP, PDCD2, ELP3, CHDCl or 
ERG, for example. In certain embodiments, 3 or more, or 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 
18, 19 or 20 or more polymorphic variants are detected in the sample. In specific embodiments, 
polymorphic variants are detected in a PSMBl, TBP, PDCD2, ELP3, CHDCl or ERG region, for 
example. In another embodiment, polymorphic variants are detected at two or more positions selected 
from the group consisting of rs756519, rsl042327, rs8770, rsl563055, rs912428 and rsl888475; 229, 
16559, 44850 and/or 46252 in SEQ ID NO: 1; 40985, 46168, 51925 and/or 52168 in SEQ ID NO: 2; 
49693, 82475, 92462 and/or 96275 in SEQ ID NO: 3; and 33137, 55499 and/or 58282 in SEQ ID NO: 4. 
In certain embodiments, polymorphic variants are detected at other genetic loci (e.g., the polymorphic 
variants can be detected in TBP, PDCD2, ELP3, CHDCl or ERG in addition to other loci or 

only in other loci), where the other loci include but are not limited to those described in concurrently- 
filed patent applications having attorney docket number 524593008800, 524593008900, 524593009000, 
524593009100 or 524593009200, each of which is incorporated herein by reference in its entirety. 

[0087] Results from prognostic tests may be combined with other test results to diagnose 
osteoarthritis. For example, prognostic results may be gathered, a patient sample may be ordered based 
on a determined predisposition to osteoarthritis, the patient sample is analyzed, and the results of the 
analysis may be utilized to diagnose osteoarthritis. Also osteoarthritis diagnostic method can be 
developed from studies used to generate prognostic methods in which populations are stratified into 
subpopulations having different progressions of osteoarthritis. In another embodiment, prognostic results 
may be gathered, a patient's risk factors for developing osteoarthritis {e.g., age, weight, race, diet) 
analyzed, eind a patient sample may be ordered based on a determined predisposition to osteoarthritis. 

[0088] The nucleic acid sample typically is isolated from a biological sample obtained from a 
subject. For example, nucleic acid can be isolated from blood, saliva, sputum, urine, cell scrapings, and 
biopsy tissue. The nucleic acid sample can be isolated from a biological sample using standard 
techniques, such as the technique described in Example 2. As used herein, the term "subject" refers 
primarily to hvmians but also refers to other mammals such as dogs, cats, and imgulates {e.g., cattle, 
sheep, and swine). Subjects also include avians {e.g., chickens and turkeys), reptiles, and fish {e.g., 
salmon), as embodiments described herein can be adapted to nucleic acid samples isolated from any of 
these organisms. The nucleic acid sample may be isolated from the subject and then directly utilized in a 
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method for determining the presence of a polymorphic variant, or alternatively, the sample may be 
isolated and then stored (e.g., frozen) for a period of time before being subjected to analysis. 

[0089] The presence or absence of a polymorphic variant is determined using one or both 
chromosomal complements represented in the nucleic acid sample. Determining the presence or absence 
of a polymorphic variant in both chromosomal complements represented in a nucleic acid sample from a 
subject having a copy of each chromosome is usefiil for determining the zygosity of an individual for the 
polymorphic variant (i.e., whether the individual is homozygous or heterozygous for the polymorphic 
variant). Any oligonucleotide-based diagnostic may be utilized to determine whether a sample includes 
the presence or absence of a polymorphic variant in a sample. For example, primer extension methods, 
ligase sequence determination methods (e.g., U.S. Pat. Nos. 5,679,524 and 5,952,174, and WO 
01/27326), mismatch sequence determination methods (e.g., U.S. Pat. Nos. 5,851,770; 5,958,692; 
6,110,684; and 6,183,958), microarray sequence determination methods, restriction fragment length 
polymorphism (RFLP), single strand conformation polymorphism detection (SSCP) (e.g., U.S. Pat. Nos. 
5.891,625 and 6,013,499), PCR-based assays (e.g., TAQMAN® PCR System (Applied Biosystems)), and 
nucleotide sequencing methods may be used. 

[0090] Oligonucleotide extension methods typically involve providing a pair of oligonucleotide 
primers in a polymerase chain reaction (PCR) or in other nucleic acid amplification methods for the 
purpose of amplifying a region from the nucleic acid sample that comprises the polymorphic variation. 
One oligonucleotide primer is complementary to a region 3 ' of the polymorphism and the other is 
complementsiry to a region 5' of the polymorphism. A PCR primer pair may be used in methods 
disclosed in U.S. Pat. Nos. 4,683,195; 4,683,202, 4,965.188; 5,656,493; 5.998,143; 6,140,054; WO 
01/27327; and WO 01/27329 for example. PCR primer pairs may also be used in any commercially 
available machines that perform PCR, such as any of the GENEAMP® Systems available from Applied 
Biosystems. Also, those of ordinary skill in the art will be able to design oligonucleotide primers based 
upon a PSMBl. TBP, PDCD2, ELP3, CHDCl or ERG nucleotide sequence using knowledge available in 
the art. 

[0091] Also provided is an extension oligonucleotide that hybridizes to the amplified fiagment 
adjacent to the polymorphic variation. As used herein, the term "adjacent" refers to the 3' end of the 
extension oHgonucleotide being often 1 nucleotide from the 5' end of the polymorphic site, and 
sometimes 2, 3, 4, 5, 6, 7, 8, 9, or 10 nucleotides from the 5' end of the polymorphic site, in the nucleic 
acid when the extension oligonucleotide is hybridized to the nucleic acid. The extension oligonucleotide 
then is extended by one or more nucleotides, and the number and/or type of nucleotides that are added to 
the extension oligonucleotide determine whether the polymorphic variant is present. Oligonucleotide 
extension methods are disclosed, for example, in U.S. Pat. Nos. 4,656,127; 4,851,331; 5,679,524; 
5,834,189; 5,876,934; 5,908,755; 5,912,118; 5,976,802; 5,981,186; 6,004,744; 6,013,431; 6,017,702; 
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6,046,005; 6,087,095; 6,210,891; and WO 01/20039. Oligonucleotide extension methods using mass 
spectrometry are described, for example, in U.S. Pat. Nos. 5,547,835; 5,605,798; 5,691 ,141; 5,849,542; 
5,869,242; 5,928,906; 6,043,031; and 6,194,144, and a method often utilized is described herein in 
Example 2. 

[0092] A microarray can be utilized for determining whether a polymorphic variant is present or 
absent in a nucleic acid sample. A microarray may include any oligonucleotides described herein, and 
methods for making and using oligonucleotide microarrays suitable for diagnostic use are disclosed in 
U.S. Pat. Nos. 5,492,806; 5,525,464; 5,589,330; 5,695,940; 5,849,483; 6,018,041; 6,045,996; 6,136,541; 
6,142,681; 6,156,501; 6,197,506; 6,223,127; 6,225,625; 6,229,91 1 ; 6,239,273; WO 00/52625; WO 
01/25485; and WO 01/29259. The microarray typically comprises a solid support and the 
oligonucleotides may be linked to this sohd support by covalent bonds or by non-covalent interactions. 
The oligonucleotides may also be linked to the solid support directly or by a spacer molecule. A 
microarray may comprise one or more oligonucleotides complementary to a polymorphic site set forth 
herein. 

[0093] A kit also may be utilized for determining whether a polymorphic variant is present or absent 
in a nucleic acid sample. A kit often comprises one or more pairs of oligonucleotide primers useftil for 
amplifying a fi-agment of a nucleotide sequence of SEQ ID NO: 1-12 or a substantially identical sequence 
thereof, where the fragment includes a poljmiorphic site. The kit sometimes comprises a polymerizing 
agent, for example, a thermostable nucleic acid polymerase such as one disclosed in U.S. Pat. Nos. 
4,889,818 or 6,077,664. Also, the kit often comprises an elongation oligonucleotide that hybridizes to a 
PSMBl, TBP, PDCD2, ELP3, CHDCl or ERG nucleotide sequence in a nucleic acid sample adjacent to 
the polymorphic site. Where the kit includes an elongation oligonucleotide, it also often comprises chain 
elongating nucleotides, such as dATP, dTTP, dGTP, dCTP, and dlTP, including analogs of dATP, dTTP, 
dGTP, dCTP and dITP, provided that such analogs are substrates for a thermostable nucleic acid 
polymerase and can be incorporated into a nucleic acid chain elongated firom the extension 
oligonucleotide. Along with chain elongating nucleotides would be one or more chain terminating 
nucleotides such as ddATP, ddTTP, ddGTP, ddCTP, and the like. In an embodiment, the kit comprises 
one or more oligonucleotide primer pairs, a polymerizing agent, chain elongating nucleotides, at least one 
elongation oligonucleotide, and one or more chain terminating nucleotides. Kits optionally include 
buffers, vials, microtiter plates, and instructions for use. 

[0094] An individual identified as being at risk of osteoarthritis may be heterozygous or 
homozygous with respect to the allele associated with a higher risk of osteoarthritis. A subject 
homozygous for an allele associated with an increased risk of osteoarthritis is at a comparatively high 
risk of osteoarthritis, a subject heterozygous for an allele associated with an increased risk of 
osteoarthritis is at a comparatively intermediate risk of osteoarthritis, and a subject homozygous for an 
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allele associated with a decreased risk of osteoarthritis is at a comparatively low risk of osteoarthritis. A 
genotype may be assessed for a complementary strand, such that the complementary nucleotide at a 
particular position is detected. 

[0095] Also featured are methods for determining risk of osteoarthritis and/or identifying a subject 
at risk of osteoarthritis by contacting a polypeptide or protein encoded by a PSMBl, TBP, PDCD2, 
ELF 3, CHDCl or ERG nucleotide sequence from a subject with an antibody that specifically binds to an 
epitope associated with increased risk of osteoarthritis in the polypeptide. 

Applications of Progn ostic and Diagnostic Resuhs to Pharmacogenomic Methods 

[0096] Pharmacogenomics is a discipline that involves tailoring a treatment for a subject according 
to the subject's genotype as a pjirticular treatment regimen may exert a differential effect depending upon 
the subject's genotype. For example, based upon the outcome of a prognostic test described herein, a 
clinician or physician may target pertinent information and preventative or therapeutic treatments to a 
subject who would be benefited by the information or treatment and avoid directing such information and 
treatments to a subject who would not be benefited {e.g., the treatment has no therapeutic effect and/or 
the subject experiences adverse side effects). 

[0097] The following is an example of a pharmacogenomic embodiment. A particular treatment 
regimen can exert a differential effect depending upon the subject's genotype. Where a candidate 
therapeutic exhibits a significant interaction with a major allele and a comparatively weak interaction 
with a minor allele (e.g., an order of magnitude or greater difference in the interaction), such a 
therapeutic typically would not be administered to a subject genotyped as being homozygous for the 
minor allele, and sometimes not administered to a subject genotyped as being heterozygous for the minor 
allele. In another example, where a candidate therapeutic is not significantly toxic when administered to 
subjects who are homozygous for a major allele but is comparatively toxic when administered to subjects 
heterozygous or homozygous for a minor allele, the candidate therapeutic is not typically administered to 
subjects who are genotyped as being heterozygous or homozygous with respect to the minor allele. 

[0098] The methods described herein are appHcable to pharmacogenomic methods for preventing, 
alleviating or treating osteoarthritis. For example, a nucleic acid sample fi-om an individual may be 
subjected to a prognostic test described herein. Where one or more polymorphic variations associated 
with increased risk of osteoarthritis are identified in a subject, information for preventing or treating 
osteoarthritis and/or one or more osteoarthritis treatment regimens then may be prescribed to that subject. 

[0099] In certain embodiments, a treatment or preventative regimen is specifically prescribed and/or 
administered to individuals who will most benefit from it based upon their risk of developing 
osteoarthritis assessed by the methods described herein. Thus, provided are methods for identifying a 
subject predisposed to osteoarthritis and then prescribing a therapeutic or preventative regimen to 
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individuals identified as having a predisposition. Thus, certain embodiments are directed to a method for 
reducing osteoarthritis in a subject, which comprises: detecting the presence or absence of a polymorphic 
variant associated with osteoarthritis in a nucleotide sequence in a nucleic acid sample fi-om a subject, 
where the nucleotide sequence comprises a polynucleotide sequence selected from the group consisting 
of: (a) a nucleotide sequence of SEQ ID NO: 1-12; (b) a nucleotide sequence which encodes a 
polypeptide consisting of an amino acid sequence encoded by a nucleotide sequence of SEQ ID NO: 1- 
12; (c) a nucleotide sequence which encodes a polypeptide that is 90% or more identical to an amino acid 
sequence encoded by a nucleotide sequence of SEQ ID NO: 1-12, or a nucleotide sequence about 90% or 
more identical to a nucleotide sequence of SEQ ID NO: 1 -1 2; and (d) a fragment of a polynucleotide 
sequence of (a), (b), or (c); and prescribing or administering a treatment regimen to a subject fi-om whom 
the sample originated where the presence of a polymorphic variation associated with osteoarthritis is 
detected in the nucleotide sequence. In these methods, predisposition results may be utilized in 
combination with other test results to diagnose osteoarthritis. 

[0100] Certain preventative treatments often are prescribed to subjects having a predisposition to 
osteoarthritis and where the subject is diagnosed with osteoarthritis or is diagnosed as having symptoms 
indicative of an early stage of osteoarthritis. The treatment sometimes is preventative (.e.g., is prescribed 
or administered to reduce the probability that osteoarthritis arises or progresses), sometimes is 
therapeutic, and sometimes delays, alleviates or halts the progression of osteoarthritis. Any known 
preventative or therapeutic treatment for zdleviating or preventing the occurrence of osteoarthritis is 
prescribed and/or administered. For example, the treatment often is directed to decreasing pain and 
improving joint movement. Examples of OA treatments include exercises to keep joints flexible and 
improve muscle strength. Different medications to control pain, including corticosteroids and 
nonsteroidal anti-inflammatory drugs (NSAIDs, e.g.. Voltaren); cycIooxygenase-2 (COX-2) inhibitors 
(e.g., Celebrex, Vioxx, Mobic, and Bextra); monoclonal antibodies (e.g., Remicade); timior necrosis 
factor inhibitors (e.g., Enbrel); or injections of glucocorticoids, hyaluronic acid or chondrotin sulfate into 
joints that are inflamed and not responsive to NSAIDS. Orally administered chondroitin sulfate also^inay 
be used as a therapeutic, as it may increeise hyaluronic acid levels and viscosity of synovial fluid, ancf 
decrease coUagenase levels in synovial fluid. Also, glucosamine can serve as an OA therapeutic as 
delivering it into joints may inhibit enzymes involved in cartilage degradation and enhance the 
production of hyaluronic acid. For mild pain without inflammation, acetaminophen may be used. Other 
treatments include: heat/cold therapy for temporary pain relief; joint protection to prevent strain or stress 
on painful joints; surgery to relieve chronic pain in damaged joints; and weight control to prevent extra 
stress on weight-bearing joints. 

[0101] As therapeutic approaches for treating osteoarthritis continue to evolve and improve, the goal 
of treatments for osteoarthritis related disorders is to intervene even before clinical signs first manifest. 
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Thus, genetic markers associated with susceptibility to osteoarthritis prove usefiil for early diagnosis, 
prevention and treatment of osteoarthritis. 

[0102] As osteoarthritis preventative and treatment information can be specifically targeted to 
subjects in need thereof (e.g. , those at risk of developing osteoarthritis or those in an early stage of 
osteoarthritis), provided herein is a method for preventing or reducing the risk of developing 
osteoarthritis in a subject, which comprises: (a) detecting the presence or absence of a polymorphic 
variation associated with osteoarthritis at a polymorphic site in a nucleotide sequence in a nucleic acid 
sample from a subject; (b) identifying a subject with a predisposition to osteoarthritis, whereby the 
presence of the polymorphic variation is indicative of a predisposition to osteoarthritis in the subject; and 
(c) if such a predisposition is identified, providing the subject with information about methods or 
products to prevent or reduce osteoarthritis or to delay the onset of osteoarthritis. Also provided is a 
method of targeting information or advertising to a subpopulation of a human population based on the 
subpopulation being genetically predisposed to a disease or condition, which comprises: (a) detecting the 
presence or absence of a polymorphic variation associated with osteoarthritis at a polymorphic site in a 
nucleotide sequence in a nucleic acid sample from a subject; (b) identifying the subpopulation of subjects 
in which the polymorphic variation is associated with osteoarthritis; and (c) providing information only 
to the subpopulation of subjects about a particular product which may be obtained and consumed or 
applied by the subject to help prevent or delay onset of the disease or condition. 

[0103] Pharmacogenomics methods also may be used to analyze and predict a response to 
osteoarthritis treatment or a drug. For example, if pharmacogenomics analysis indicates a likelihood that 
an individual will respond positively to osteoarthritis treatment with a particular drug, the drug may be 
administered to the individual. Conversely, if the analysis indicates that an individual is likely to respond 
negatively to treatment with a particular drug, an alternative course of treatment may be prescribed. A 
negative response may be defined as either the absence of an efficacious response or the presence of toxic 
side effects. The response to a therapeutic treatment can be predicted in a background study in which 
subjects in any of the following populations are genotyped: a population that responds favorably to a 
treatment regimen, a population that does not respond significantly to a treatment regimen, and a 
population that responds adversely to a treatment regimen (e.g., exhibits one or more side effects). These 
populations are provided as examples and other populations and subpopulations may be analyzed. Based 
upon the results of these analyses, a subject is genotyped to predict whether he or she will respond 
favorably to a treatment regimen, not respond significantly to a treatment regimen, or respond adversely 
to a treatment regimen. 

[0104] The tests described herein also are applicable to clinical drug trials. One or more 
polymorphic variants indicative of response to an agent for treating osteoarthritis or to side effects to an 
agent for treating osteoarthritis may be identified using the methods described herein. Thereafter, 
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potential participants in clinical trials of such an agent may be screened to identify those individuals most 
likely to respond favorably to the drug and exclude those likely to experience side effects. In that way, 
the effectiveness of drug treatment may be measured in individuals who respond positively to the drug, 
without lowering the measurement as a result of the inclusion of individuzds who are unlikely to respond 
positively in the study and without risking undesirable safety problems. 

[0105] Thus, another embodiment is a method of selecting an individual for inclusion in a clinical 
trial of a treatment or drug comprising the steps of: (a) obtaining a nucleic acid sample from an 
individual; (b) determining the identity of a polymorphic variation which is associated with a positive 
response to the treatment or the drug, or at least one polymorphic variation which is associated with a 
negative response to the treatment or the drug in the nucleic acid sample, and (c) including the individual 
in the clinical trial if the nucleic acid sample contains said polymorphic variation associated with a 
positive response to the treatment or the drug or if the nucleic acid sample lacks said polymorphic 
variation associated with a negative response to the treatment or the drug. In addition, the methods 
described herein for selecting an individual for inclusion in a clinical trial of a treatment or drug 
encompass methods with any fiirther limitation described in this disclosure, or those following, specified 
alone or in any combination. The polymorphic variation may be in a sequence selected individually or in 
any combination from the group consisting of (i) a nucleotide sequence of SEQ ID NO: 1 -1 2; (ii) a 
nucleotide sequence which encodes a polypeptide consisting of an amino acid sequence encoded by a 
nucleotide sequence of SEQ ID NO: 1-12; (iii) a nucleotide sequence which encodes a polypeptide that is 
90% or more identical to an amino acid sequence encoded by a nucleotide sequence of SEQ ID NO: 1- 
1 2, or a nucleotide sequence about 90% or more identical to a nucleotide sequence of SEQ ID NO: 1-12; 
and (iv) a fragment of a polynucleotide sequence of (i), (ii), or (iii) comprising the polymorphic site. The 
including step (c) optionedly comprises administering the drug or the treatment to the individual if the 
nucleic acid sample contains the polymorphic variation associated with a positive response to the 
treatment or the drug and the nucleic acid sample lacks said biallelic marker associated with a negative 
response to the treatment or the drug. 

[0106] Also provided herein is a method of partnering between a diagnostic/prognostic testing 
provider and a provider of a constmiable product, which comprises: (a) the diagnostic/prognostic testing 
provider detects the presence or absence of a polymorphic variation associated with osteoarthritis at a 
polymorphic site in a nucleotide sequence in a nucleic acid sample from a subject; (b) the 
diagnostic/prognostic testing provider identifies the subpopulation of subjects in which the polymorphic 
variation is associated with osteoarthritis; (c) the diagnostic/prognostic testing provider forwards 
information to the subpopulation of subjects about a particular product which may be obtained and 
consumed or applied by the subject to help prevent or delay onset of the disease or condition; and (d) the 



31 



PATENT 
Docket 524593008700 



provider of a consumable product forwards to the diagnostic test provider a fee every time the 
diagnostic/prognostic test provider forwards information to the subject as set forth in step (c) above. 

Compositions Comprising Osteoarthritis-Directed Molecules 

[0107] Featiired herein is a composition comprising a cell from a subject having osteoarthritis or at 
risk of osteoarthritis and one or more molecules specifically directed and targeted to a nucleic acid 
comprising a PSMBl, TBP. PDCD2, ELP3, CHDCl or ERG nucleotide sequence or amino acid 
sequence. Such directed molecules include, but are not limited to, a compound that binds to a PSMBl, 
TBP, PDCD2, ELP3, CHDCl or ERG nucleotide sequence or amino acid sequence referenced herein; a 
RNAi or siRNA molecule having a strand complementary or substantially complementary to a PSMBl, 
TBP, PDCD2, ELP3, CHDCl or ERG nucleotide sequence (e.g., hybridizes to a PSMBl, TBP, PDCD2, 
ELP3, CHDCl or ERG nucleotide sequence under conditions of high stringency); an antisense nucleic 
acid complementary or substantially complementary to an RNA encoded by a PSMBl, TBP, PDCD2, 
ELP3, CHDCl or ERG nucleotide sequence (e.g., hybridizes to a PSMBl, TBP, PDCD2, ELP3, CHDCl 
or ERG nucleotide sequence under conditions of high stringency); a ribozyme that hybridizes to a 
PSMBl. TBP. PDCD2. ELP3. CHDCl or ERG nucleotide sequence (e.g., hybridizes to a PSMBl, TBP, 
PDCD2. ELP3. CHDCl or ERG nucleotide sequence under conditions of high stringency); a nucleic 
acid aptamer that specifically binds a polypeptide encoded by PSMBl, TBP, PDCD2. ELP3, CHDCl or 

nucleotide sequence; and an antibody that specifically binds to a polypeptide encoded by PSMBl, 
TBP, PDCD2. ELP3. CHDCl or ERG nucleotide sequence or binds to a nucleic acid having such a 
nucleotide sequence. In specific embodiments, the osteoarthritis directed molecule interacts with a 
nucleic acid or polypeptide variant associated with osteoarthritis, such as variants referenced herein. In 
other embodiments, the osteoarthritis directed molecule interacts with a polypeptide involved in a signal 
pathway of a polypeptide encoded by a PSMBl, TBP, PDCD2, ELP3. CHDCl or ERG nucleotide 
sequence, or a nucleic acid comprising such a nucleotide sequence. 

[0108] Compositions sometimes include an adjuvant known to stimulate an immime response, and 
in certain embodiments, an adjuvant that stimulates a T-cell lymphocyte response. Adjuvants are known, 
including but not limited to an aluminum adjuvant (e.g., aluminimi hydroxide); a cytokine adjuvant or 
adjuvant that stimulates a cytokine response (e.g., interleukin (IL)-12 and/or gamma-interferon 
cytokines); a Freund-type mineral oil adjuvant emulsion (e.g., Freund's complete or incomplete 
adjuvant); a synthetic lipoid compound; a copolymer adjuvant (e.g., TitreMax); a saponin; Quil A; a 
liposome; an oil-in-water emulsion (e.g., an emulsion stabilized by Tween 80 and pluronic 
polyoxyethlene/polyoxypropylene block copolymer (Syntex Adjuvant Formulation); TitreMax; 
detoxified endotoxin (MPL) and mycobacterial cell wall components (TDW, CWS) in 2% squalene (Ribi 
Adjuvant System)); a muramyl dipeptide; an immime-stimulating complex (ISCOM, e.g., an Ag- 
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modified saponin/cholesterol micelle that forms stable cage-like structure); an aqueous phase adjuvant 
that does not have a depot effect (e.g., Gerbu adjuvant); a carbohydrate polymer (e.g., AdjuPrime); L- 
tyrosine; a manide-oleate compoimd (e.g., Montanide); an ethylene-vinyl acetate copolymer (e.g., Elvax 
40W1 ,2); or lipid A, for example. Such compositions are useful for generating an immune response 
against osteoarthritis directed molecule (e.g., an HLA-binding subsequence within a polypeptide encoded 
by a PSMBl, TBP. PDCD2, ELP3. CHDCl or ERG nucleotide sequence). In such methods, a peptide 
having an ammo acid subsequence of a polypeptide encoded by a PSMBl, TBP, PDCD2, ELP3, CHDCl 
or ERG nucleotide sequence is delivered to a subject, where the subsequence binds to an HLA molecule 
and induces a CTL lymphocyte response. The peptide sometimes is delivered to the subject as an 
isolated peptide or as a minigene in a plasmid that encodes the peptide. Methods for identifying HLA- 
binding subsequences in such polypeptides are known (see e.g., publication WO02/20616 and PCX 
application US98/01373 for methods of identifying such sequences). 

[0109] The cell may be in a group of cells cultured in vitro or in a tissue maintained in vitro or 
present in an animal in vivo (e.g., a rat, mouse, ape or hunuin). In certain embodiments, a composition 
comprises a component from a cell such as a nucleic acid molecule (e.g., genomic DNA), a protein 
mixture or isolated protein, for example. The aforementioned compositions have utility in diagnostic, 
prognostic and pharmacogenomic methods described previously and in therapeutics described hereafter. 
Certain osteoarthritis directed molecules are described in greater detail below. 

Compoxmds 

[0110] Compounds can be obtained using any of the numerous approaches in combinatorial library 
methods known in the art, including: biological libraries; peptoid libraries (libraries of molecules having 
the functionalities of peptides, but with a novel, non-peptide backbone which are resistant to enzymatic 
degradation but which nevertheless remain bioactive (see, e.g., Zuckermann et al., J. Med. Chem.37: 
2678-85 (1994)); spatially addressable parallel solid phase or solution phase libraries; synthetic library 
methods requiring deconvolution; "one-bead one-compound" library methods; and synthetic library 
methods using affinity chromatography selection. Biological library and peptoid Ubrary approaches are 
typically limited to peptide libraries, while the other approaches are applicable to peptide, non-peptide 
oligomer or small molecule libraries of compoimds (Lam, Anticancer Drug Des. 12: 145, (1997)). 
Examples of methods for synthesizing molecular libraries are described, for example, in DeWitt et al., 
Proc. Natl. Acad. Sci. U.S.A. 90: 6909 (1993); Erb et al., Proc. Natl. Acad. Sci. USA 91: 1 1422 (1994); 
Zuckermann et al., J. Med. Chem. 37: 2678 (1994); Cho et al., Science 261: 1303 (1993); Carrell et al., 
Angew. Chem. Int. Ed. Engl. 33: 2059 (1994); Carell et al., Angew. Chem. Int. Ed. Engl. 33: 2061 
(1994); and in Gallop et al., J. Med. Chem. 37: 1233 (1994). 
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[0111] Libraries of compounds may be presented in solution (e.g., Houghten, Biotechniques 13: 
412-421 (1992)), or on beads (Lam, Nature 354: 82-84 (1991)), chips (Fodor, Nature 364: 555-556 
(1993)), bacteria or spores (Ladner, United States Patent No. 5,223,409), plasmids (Cull et al., Proc. Natl. 
Acad. Sci. USA 89: 1865-1869 (1992)) or on phage (Scott and Smith, Science 249: 386-390 (1990); 
Devlin, Science 249: 404-406 (1990); Cwirla et al., Proc. Natl. Acad. Sci. 87: 6378-6382 (1990); Felici, 
J. Mol. Biol. 222: 301-310 (1991); Ladner supra.). 

[0112] A compound sometimes alters expression and sometimes alters activity of a polypeptide 
target and may be a small molecule. Small molecules include, but are not limited to, peptides, 
peptidomimetics (e.g., peptoids), amino acids, amino acid analogs, polynucleotides, polynucleotide 
analogs, nucleotides, nucleotide analogs, organic or inorganic compounds (i.e., including heteroorganic 
and organometallic compounds) having a molecular weight less than about 10,000 grams per mole, 
organic or inorganic compounds having a molecular weight less than about 5,000 grams per mole, 
organic or inorganic compounds having a molecular weight less than about 1,000 grams per mole, 
organic or inorganic compounds having a molecular weight less than about 500 grams per mole, and 
salts, esters, and other pharmaceutically acceptable forms of such compounds. 

Antisense Nucleic Acid Molecules. Ribozvmes. RNAi. siRNA and Modifi ed Nucleic Acid 
Molecules 

[0113] An "antisense" nucleic acid refers to a nucleotide sequence complementary to a "sense" 
nucleic acid encoding a polypeptide, e.g., complementary to the coding strand of a double-stranded 
cDNA molecule or complementary to an mRNA sequence. The antisense nucleic acid can be 
complementeuy to an entire coding strand, or to a portion thereof or a substantially identical sequence 
thereof. In another embodiment, the antisense nucleic acid molecule is antisense to a "noncoding region" 
of the coding strand of a nucleotide sequence (e.g., 5' and 3' untranslated regions in SEQ ID NO: 1). 

[0114] An antisense nucleic acid can be designed such that it is complementary to the entire coding 
region of an mRNA encoded by a nucleotide sequence (e.g., SEQ ID NO: 1), and often the antisense 
nucleic acid is an oligonucleotide antisense to only a portion of a coding or noncoding region of the 
mRNA. For example, the antisense oligonucleotide can be complementary to the region surrounding the 
translation start site of the mRNA, e.g., between the -10 and +10 regions of the target gene nucleotide 
sequence of interest. An antisense oligonucleotide can be, for example, about 7, 10, 15, 20, 25, 30, 35, 
40, 45, 50, 55, 60, 65, 70, 75, 80, or more nucleotides in length. The antisense nucleic acids, which 
include the ribozymes described hereafter, can be designed to target a PSMBl, TBP, PDCD2, ELP3, 
CHDCl or ERG nucleotide sequence, often a variant associated with osteoarthritis, or a substantially 
identical sequence thereof. Among the variants, minor alleles and major alleles can be targeted, and 
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those associated with a higher risk of osteoarthritis are often designed, tested, and administered to 
subjects. 

[0115] An antisense nucleic acid can be constructed using chemical synthesis and enzymatic ligation 
reactions using standard procedures. For example, an antisense nucleic acid (e.g., an antisense 
oligonucleotide) can be chemically synthesized using naturally occurring nucleotides or variously 
modified nucleotides designed to increase the biological stability of the molecules or to increase the 
physical stability of the duplex formed between the antisense and sense nucleic acids, e.g., 
phosphorothioate derivatives and acridine substituted nucleotides can be used. Antisense nucleic acid 
also can be produced biologically using an expression vector into which a nucleic acid has been 
subcloned in an antisense orientation (i.e., RNA transcribed from the inserted nucleic acid will be of an 
antisense orientation to a target nucleic acid of interest, described further in the following subsection). 

[0116] When utilized as therapeutics, antisense nucleic acids typically are administered to a subject 
(e.g., by direct injection at a tissue site) or generated in situ such that they hybridize with or bind to 
cellular mRNA and/or genomic DNA encoding a polypeptide and thereby inhibit expression of the 
polypeptide, for example, by inhibiting transcription and/or translation. Alternatively, antisense nucleic 
acid molecules can be modified to target selected cells and then are administered systemically. For 
systemic administration, antisense molecules can be modified such that they specifically bind to receptors 
or antigens expressed on a selected cell surface, for example, by linking antisense nucleic acid molecules 
to peptides or antibodies which bind to cell surface receptors or antigens. Antisense nucleic acid 
molecules can also be delivered to cells using the vectors described herein. Sufficient intracellular 
concentrations of antisense molecules are achieved by incorporating a strong promoter, such as a pol II or 
pol III promoter, in the vector construct. 

[0117] Antisense nucleic acid molecules sometimes are alpha-anomeric nucleic acid molecules. An 
alpha-anomeric nucleic acid molecule forms specific double-stranded hybrids with complementary RNA 
in which, contrary to the usual beta-units, the strands run parallel to each other (Gaultier et al.. Nucleic 
Acids. Res. 15: 6625-6641 (1987)). Antisense nucleic acid molecules can also comprise a 2'-o- 
methylribonucleotide (Inoue et al., Nucleic Acids Res. 15: 6131-6148 (1987)) or a chimeric RNA-DNA 
analogue (Inoue et al., FEBS Lett. 215: 327-330 (1987)). Antisense nucleic acids sometimes are 
composed of DNA or PNA or any other nucleic acid derivatives described previously. 

[0118] In another embodiment, an antisense nucleic acid is a ribozyme. A ribozyme having 
specificity for a PSMBl, TBP. PDCD2. ELP3, CHDCl or ERG nucleotide sequence can include one or 
more sequences complementary to such a nucleotide sequence, and a sequence having a icnown catalytic 
region responsible for noRNA cleavage (see e.g., U.S. Pat. No. 5,093,246 or Haselhoff and Gerlach, 
Nature 334: 585-591 (1 988)). For example, a derivative of a Tetrahymena L-1 9 IVS RNA is sometimes 
utilized in which the nucleotide sequence of the active site is complementary to the nucleotide sequence 
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to be cleaved in a mRNA (see e.g., Cech et al. U.S. Patent No. 4,987,071; and Cech et al. U.S. Patent No. 
5,116,742). Also, target mRNA sequences can be used to select a catalj^ic RNA having a specific 
ribonuclease activity from a pool of RNA molecules (see e.g., Bartel & Szostak, Science 261 : 1411-141 8 
(1993)). 

[0119] Osteoarthritis directed molecules include in certain embodiments nucleic acids that can form 
triple helix structures with a PSMBl. TBP, PDCD2, ELP3, CHDCl or ERG nucleotide sequence, or a 
substantially identical sequence thereof, especially one that includes a regulatory region that controls 
expression of a polypeptide. Gene expression can be inhibited by targeting nucleotide sequences 
complementary to the regulatory region of a nucleotide sequence referenced herein or a substantially 
identical sequence (e.g., promoter and/or enhancers) to form triple helical structures that prevent 
transcription of a gene in target cells (see e.g., Helene, Anticancer Drug Des. 6(6): 569-84 (1991); Helene 
et al.. Ann. N.Y. Acad. Sci. 660: 27-36 (1992); andMaher, Bioassays 14(12): 807-15 (1992). Potential 
sequences that can be targeted for triple helix formation can be increased by creating a so-called 
"switchback" nucleic acid molecule. Switchback molecules are synthesized in an alternating 5'-3', 3'-5' 
manner, such that they base pair with first one strand of a duplex and then the other, eliminating the 
necessity for a sizeable stretch of either purines or pyrimidines to be present on one strand of a duplex. 

[0120] Osteoarthritis directed molecules include RNAi and siRNA nucleic acids. Gene expression 
may be inhibited by the introduction of double-stranded RNA (dsRNA), which induces potent and 
specific gene silencing, a phenomenon called RNA interference or RNAi. See, e.g.. Fire et al., US Patent 
Number 6,506,559; Tuschl et al. PCT International Publication No. WO 01/75164; Kay et al. PCT 
International Publication No. WO 03/0101 80A1; or Bosher JM, Labouesse, Nat Cell Biol 2000 
Feb;2(2):E31-6. This process has been improved by decreasing the size of the double-stranded RNA to 
20-24 base pairs (to create small-interfering RNAs or siRNAs) that "switched off' genes in mammalian 
cells without initiating an acute phase response, i.e., a host defense mechanism that often results in cell 
death (see, e.g.. Caplen et al. Proc Natl Acad Sci USA. 2001 Aug 14;98(17):9742-7 and Elbashir et al. 
Methods 2002 Feb;26(2): 199-2 13). There is increasing evidence of post-transcriptional gene silencing 
by RNA interference (RNAi) for inhibiting targeted expression in mammalian cells at the mRNA level, 
in human cells. There is additional evidence of effective methods for inhibiting the proliferation and 
migration of timior cells in himian patients, and for inhibiting metastatic cancer development (see, e.g., 
U.S. Patent Application No. US20010009931 83; Caplen et al. Proc Natl Acad Sci U S A; and 
Abderrahmani et al. Mol Cell Biol 2001 Nov21(21):7256-67). 

[0121] An "siRNA" or "RNAi" refers to a nucleic acid that forms a double stranded RNA and has 
the ability to reduce or inhibit expression of a gene or target gene when the siRNA is delivered to or 
expressed in the same cell as the gene or target gene. "siRNA" refers to short double-stranded RNA 
formed by the complementary strands. Complementary portions of the siRNA that hybridize to form the 
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double stranded molecule often have substantial or complete identity to the target molecule sequence. In 
one embodiment, an siRNA refers to a nucleic acid that has substantial or complete identity to a target 
gene and forms a double stranded siRNA. 

[0122] When designing the siRNA molecules, the targeted region often is selected from a given 
DNA sequence beginning 50 to 100 nucleotides downstream of the start codon. See, e.g., Elbashir et al,. 
Methods 26:199-213 (2002). Initially, 5' or 3' UTRs and regions nearby the start codon were avoided 
assimiing that UTR-binding proteins and/or translation initiation complexes may interfere with binding of 
the siRNP or RISC endonuclease complex. Sometimes regions of the target 23 nucleotides in length 
conforming to the sequence motif AA(N19)TT (N, an nucleotide), and regions with approximately 30% 
to 70% G/C-content (often about 50% G/C-content) often are selected. If no suitable sequences are 
found, the search often is extended using the motif NA(N21). The sequence of the sense siRNA 
sometimes corresponds to (N19) TT or N21 (position 3 to 23 of the 23-nt motif), respectively. In the 
latter case, the 3' end of the sense siRNA often is converted to TT. The rationale for this sequence 
conversion is to generate a symmetric duplex with respect to the sequence composition of the sense and 
antisense 3' overhangs. The antisense siRNA is synthesized as the complement to position 1 to 21 of the 
23-nt motif. Because position 1 of the 23-nt motif is not recognized sequence-specifically by the 
antisense siRNA, the 3 '-most nucleotide residue of the antisense siRNA can be chosen deliberately. 
However, the penultimate nucleotide of the antisense siRNA (complementary to position 2 of the 23-nt 
motif) often is complementary to the targeted sequence. For simplifying chemical synthesis, TT often is 
utilized. siRNAs corresponding to the target motif NAR(N17)YNN, where R is purine (A,G) and Y is 
pyrimidine (C,U), often are selected. Respective 21 nucleotide sense and antisense siRNAs often begin 
with a purine nucleotide and can also be expressed fi-om pol IQ expression vectors without a change in 
targeting site. Expression of RNAs fixim pol DI promoters often is efficient when the first transcribed 
nucleotide is a purine. 

[0123] The sequence of the siRNA can correspond to the full length target gene, or a subsequence 
thereof. Often, the siRNA is about 15 to about 50 nucleotides in length (e.g., each complementary 
sequence of the double stranded siRNA is 15-50 nucleotides in length, and the double stranded siRNA is 
about 1 5-50 base pairs in length, sometimes about 20-30 nucleotides in length or about 20-25 nucleotides 
in length, e.g., 20, 21. 22, 23, 24, 25, 26, 27, 28, 29, or 30 nucleotides in length. The siRNA sometimes 
is about 21 nucleotides in length. Methods of using siRNA are well known in the art, and specific siRNA 
molecules may be purchased fi-om a number of companies including Dharmacon Research, Inc. 

[0124] Antisense, ribozyme, RNAi and siRNA nucleic acids can be altered to form modified nucleic 
acid molecules. The nucleic acids can be altered at base moieties, sugar moieties or phosphate backbone 
moieties to improve stability, hybridization, or solubility of the molecule. For example, the deoxyribose 
phosphate backbone of nucleic acid molecules can be modified to generate peptide nucleic acids (see 
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Hyrup et al., Bioorganic & Medicinal Chemistry 4 (1): 5-23 (1996)). As used herein, the terms "peptide 
nucleic acid" or "PNA" refers to a nucleic acid mimic such as a DNA mimic, in which the deoxyribose 
phosphate backbone is replaced by a pseudopeptide backbone and only the four natural nucleobases are 
retained. The neutral backbone of a PNA can allow for specific hybridization to DNA and RNA under 
conditions of low ionic strength. Synthesis of PNA oligomers can be performed using standard solid 
phase peptide synthesis protocols as described, for example, in Hyrup et al., (1996) supra and Perry- 
O'Keefe et al., Proc. Natl. Acad. Sci. 93: 14670-675 (1996). 

[0125] PNA nucleic acids can be used in prognostic, diagnostic, and therapeutic applications. For 
example, PNAs can be used as antisense or antigene agents for sequence-specific modulation of gene 
expression by, for example, inducing transcription or translation arrest or inhibiting replication. PNA 
nucleic acid molecules can also be used in the analysis of single base pair mutations in a gene, (e.g., by 
PNA-directed PCR clamping); as "artificial restriction enzymes" when used in combination with other 
enzymes, (e.g., SI nucleases (Hyrup (1996) supra)); or as probes or primers for DNA sequencing or 
hybridization (Hyrup et al., (1996) supra; Perry-O'Keefe supra). 

[0126] In other embodiments, oligonucleotides may include other appended groups such as peptides 
(e.g., for targeting host cell receptors in vivo), or agents facilitating transport across cell membranes (see 
e.g., Letsinger et al., Proc. Natl. Acad. Sci. USA 86: 6553-6556 (1989); Lemaitre et al., Proc. Natl. Acad. 
Sci. USA 84: 648-652 (1987); PCT Publication No. W088/O981O) or the blood-brain barrier (see, e.g., 
PCT Publication No. W089/10134). In addition, oligonucleotides can be modified with hybridization- 
triggered cleavage agents (See, e.g., Krol et al., Bio-Techniques 6: 958-976 (1988)) or intercalating 
agents. (See, e.g., Zon, Pharm. Res. 5: 539-549 (1988) ). To this end, the oligonucleotide may be 
conjugated to another molecule, (e.g., a peptide, hybridization triggered cross-linking agent, transport 
agent, or hybridization-triggered cleavage agent). 

[0127] Also included herein are molecular beacon oligonucleotide primer and probe molecules 
having one or more regions complementary to a PSMBl. TBP, PDCD2, ELP3, CHDCl or ERG 
nucleotide sequence, or a substantially identical sequence thereof, two complementary regions one 
having a fluorophore and one a quencher such that the molecular beacon is useful for quantifying the 
presence of the nucleic acid in a sample. Molecular beacon nucleic acids are described, for example, in 
Lizardi et al., U.S. Patent No. 5,854,033; Nazarenko et al., U.S. Patent No. 5,866,336, and Livak et al., 
U.S. Patent 5,876,930. 

Antibodies 

[0128] The term "antibody" as used herein refers to an immimoglobulin molecule or 
immunologically active portion thereof, i.e., an antigen-binding portion. Examples of immunologically 
active portions of immimoglobulin molecules include F(ab) and F(ab')2 fi:agments which can be 
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generated by treating the antibody with an enzyme such as pepsin. An antibody sometimes is a 
polyclonal, monoclonal, recombinant (e.g., a chimeric or humanized), fully human, non-human (e.g., 
murine), or a single chain antibody. An antibody may have effector function and can fix complement, 
and is sometimes coupled to a toxin or imaging agent. 

[0129] A full-length polypeptide or antigenic peptide fi-agment encoded by a nucleotide sequence 
referenced herein can be used as an immunogen or can be used to identify antibodies made with other 
immunogens, e.g., cells, membrane preparations, and the like. An antigenic peptide often includes at 
least 8 amino acid residues of the amino acid sequences encoded by a nucleotide sequence referenced 
herein, or substantially identical sequence thereof, and encompasses an epitope. Antigenic peptides 
sometimes include 10 or more amino acids, 15 or more amino acids, 20 or more amino acids, or 30 or 
more amino acids. Hydrophilic and hydrophobic fragments of polypeptides sometimes are used as 
immunogens. 

[0130] Epitopes encompassed by the antigenic peptide are regions located on the surface of the 
polypeptide (e.g., hydrophiUc regions) as well as regions with high antigenicity. For example, an Emini 
sxuface probability analysis of the human polypeptide sequence can be used to indicate the regions that 
have a particularly high probability of being localized to the surface of the polypeptide and are thus likely 
to constitute surface residues useful for targeting antibody production. The antibody may bind an epitope 
on any domain or region on polypeptides described herein. 

[0131] Also, chimeric, humanized, and completely human antibodies are useful for applications 
which include repeated administration to subjects. Chimeric and humanized monoclonal antibodies, 
conq)rising both human and non-himian portions, can be made using standard recombinant DNA 
techniques. Such chimeric and humanized monoclonal antibodies can be produced by recombinant DNA 
techniques known in the art, for example using methods described in Robinson et al International 
Application No. PCT/US86/02269; AJdra, et al European Patent Application 1 84,1 87; Taniguchi, M., 
European Patent Apphcation 171,496; Morrison et al European Patent Application 173,494; Neuberger et 
al PCT International Publication No. WO 86/01533; Cabilly et al U.S. Patent No. 4,816,567; Cabilly et al 
European Patent Application 125,023; Better et al.. Science 240: 1041-1043 (1988); Liu et al., Proc. Natl. 
Acad. Sci. USA 84: 3439-3443 (1987); Liu et al., J. Immunol. 139: 3521-3526 (1987); Sun et al., Proc. 
Natl. Acad. Sci. USA 84: 214-218 (1987); Nishimura et al.. Cane. Res. 47: 999-1005 (1987); Wood et al., 
Nature 314: 446-449 (1985); and Shaw et al., J. Natl. Cancer Inst. 80: 1553-1559 (1988); Morrison, S. 
L., Science 229: 1202-1207 (1985); Oi et al., BioTechniques 4: 214 (1986); Winter U.S. Patent 
5,225,539; Jones et al.. Nature 321: 552-525 (1986); Verhoeyan et al.. Science 239: 1534; and Beidler et 
al., J. Immunol. 141: 4053-4060 (1988). 

[0132] Completely human antibodies are particularly desirable for therapeutic treatment of human 
patients. Such antibodies can be produced using transgenic mice that are incapable of expressing 
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endogenous immunoglobulin heavy and light chains genes, but which can express human heavy and light 
chain genes. See, for example, Lonberg and Huszar, Int. Rev. Immunol. 13: 65-93 (1995); and U.S. 
Patent Nos. 5,625,126; 5,633,425; 5,569,825; 5,661,016; and 5,545,806. In addition, companies such as 
Abgenix, Inc. (Fremont, CA) and Medarex, Inc. (Princeton, NJ), can be engaged to provide human 
antibodies directed against a selected antigen using technology similar to that described above. 
Completely human antibodies that recognize a selected epitope also can be generated using a technique 
referred to as "guided selection." In this approach a selected non-human monoclonal antibody (e.g., a 
murine antibody) is used to guide the selection of a completely human antibody recognizing the same 
epitope. This technology is described for example by Jespers et al., Bio/Technology 12: 899-903 (1994). 

[0133] An antibody can be a single chain antibody. A single chain antibody (scFV) can be 
engineered (see, e.g., Colcher et al., Ann. N Y Acad. Sci. 880: 263-80 (1999); and Reiter, Clin. Cancer 
Res. 2: 245-52 (1996)). Single chain antibodies can be dimerized or multimerized to generate 
multivalent antibodies having specificities for different epitopes of the same target polypeptide. 

[0134] Antibodies also may be selected or modified so that they exhibit reduced or no ability to bind 
an Fc receptor. For example, an antibody may be an isotype or subtype, fragment or other mutant, which 
does not support binding to an Fc receptor (e.g., it has a mutagenized or deleted Fc receptor binding 
region). 

[0135] Also, an antibody (or fragment thereof) may be conjugated to a therapeutic moiety such as a 
cytotoxin, a therapeutic agent or a radioactive metal ion. A cytotoxin or cytotoxic agent includes any 
agent that is detrimental to cells. Examples include taxol, cytochalasin B, gramicidin D, ethidium 
bromide, emetine, mitomycin, etoposide, tenoposide, vincristine, vinblastine, colchicin, doxorubicin, 
daunorubicin, dihydroxy anthracin dione, mitoxantrone, mithramycin, actinomycin D, 1 
dehydrotestosterone, glucocorticoids, procaine, tetracaine, lidocaine, propranolol, and puromycin and 
analogs or homologs thereof. Therapeutic agents include, but are not limited to, antimetabolites (e.g., 
methotrexate, 6-mercaptopurine, 6-thioguanine, cytarabine, 5-fluorouracil decarbazine), alkylating agents 
(e.g., mechlorethamine, thiotepa chlorambucil, melphalan, carmustine (BCNU) and lomustine (CCNU), 
cyclophosphamide, busulfan, dibromomannitol, streptozotocin, mitomycin C, and cis-dichlorodiamine 
platinum (U) (DDF) cisplatin), anthracyclines (e.g., daimorubicin (formerly daimomycin) and 
doxombicin), antibiotics (e.g., dactinomycin (formerly actinomycin), bleomycin, mithramycin, and 
anthramycin (AMC)), and anti-mitotic agents (e.g., vincristine and vinblastine). 

[0136] Antibody conjugates can be used for modifying a given biological response. For example, 
the drug moiety may be a protein or polypeptide possessing a desired biological activity. Such proteins 
may include, for example, a toxin such as abrin, ricin A, pseudomonas exotoxin, or diphtheria toxin; a 
polypeptide such as tumor necrosis factor, gamma-interferon, alpha-interferon, nerve growth factor, 
platelet derived growth factor, tissue plasminogen activator; or, biological response modifiers such as, for 
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example, lymphokines, interleukin-1 ("IL-1"), interleukin-2 ("IL-2"), interleukin-6 ("IL-6"), granulocyte 
macrophage colony stimulating factor ("GM-CSF"), granulocyte colony stimulating factor ("G-CSF"), or 
other growth factors. Also, an antibody can be conjugated to a second antibody to form an antibody 
heteroconjugate as described by Segal in U.S. Patent No. 4,676,980, for example. 

[0137] An antibody (e.g., monoclonal antibody) can be used to isolate target polypeptides by 
standard techniques, such as affinity chromatography or immunoprecipitation. Moreover, an antibody 
can be used to detect a target polypeptide (e.g., in a cellular lysate or ceU supernatant) in order to evaluate 
the abundance and pattern of expression of the polypeptide. Antibodies can be used diagnostically to 
monitor polypeptide levels in tissue as part of a clinical testing procedure, e.g., to determine the efficacy 
of a given treatment regimen. Detection can be facilitated by coupling (i.e., physically linking) the 
antibody to a detectable substance (i.e., antibody labeling). Examples of detectable substances include 
various enzymes, prosthetic groups, fluorescent materials, luminescent materials, bioluminescent 
materials, and radioactive materials. Examples of suitable enzymes include horseradish peroxidase, 
alkaline phosphatase, B-galactosidase, or acetylcholinesterase; examples of suitable prosthetic group 
complexes include streptavidin/biotin and avidin/biotin; examples of suitable fluorescent materials 
include umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, dichlorotriazinylamine 
fluorescein, dansyl chloride or phycoerythrin; an example of a luminescent material includes luminol; 
examples of bioluminescent materials include luciferase, luciferin, and aequorin, and examples of 
suitable radioactive material include '^'l, "'l, ^^S or ^H. Also, an antibody can be utilized as a test 
molecule for determining whether it can treat osteoarthritis, and as a therapeutic for administration to a 
subject for treating osteoarthritis. 

[0138] An antibody can be made by immunizing with a purified antigen, or a fragment thereof, e.g., 
a fragment described herein, a membrane associated antigen, tissues, e.g., crude tissue preparations, 
whole cells, preferably living cells, lysed cells, or cell fractions. 

[0139] Included herein are antibodies which bind only a native polypeptide, only denatured or 
otherwise non-native polypeptide, or which bind both, as well as those having linear or conformational 
epitopes. Conformational epitopes sometimes can be identified by selecting antibodies that bind to 
native but not denatured polypeptide. Also featured are antibodies that specifically bind to a polypeptide 
variant associated with osteoarthritis. 

Methods for Identifying Candidate Therapeutics for Treating Osteoarthritis 

[0140] Current therapies for the treatment of osteoarthritis have limited efiBcacy, limited tolerability 
and significant mechanism-based side effects, and few of the available therapies adequately address 
underlying defects. Current therapeutic approaches were largely developed in the absence of defined 
molecular targets or even a solid imderstanding of disease pathogenesis. Therefore, provided are 
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methods of identifying candidate therapeutics that target biochemical pathways related to the 
development of osteoarthritis. 

[0141] Thus, featured herein are methods for identifying a candidate therapeutic for treating 
osteoarthritis. The methods comprise contacting a test molecule with a target molecule in a system. A 
"target molecule" as used herein refers to a PSMBl. TBP, PDCD2, ELP3, CHDCl ox ERG nucleic acid, 
a substantially identical nucleic acid thereof, or a fragment thereof, and an encoded polypeptide of the 
foregoing. The methods also comprise determining the presence or absence of an interaction between the 
test molecule and the target molecule, where the presence of an interaction between the test molecule and 
the nucleic acid or polypeptide identifies the test molecule as a candidate osteoarthritis therapeutic. The 
interaction between the test molecule and the target molecule may be quantified. 

[0142] Test molecules and candidate therapeutics include, but are not limited to, compoimds, 
antisense nucleic acids, siRNA molecules, ribozymes, polypeptides or proteins encoded by a PSMBl, 
TBP, PDCD2, ELP3, CHDCl or ERG nucleotide sequence, or a substantially identical sequence or 
fragment thereof, and immunotherapeutics (e.g., antibodies and HLA-presented polypeptide fragments). 
A test molecule or candidate therapeutic may act as a modulator of target molecule concentration or 
target molecule function in a system. A "modulator" may agonize (i.e., up-regulates) or antagonize (i.e., 
down-regulates) a target molecule concentration partially or completely in a system by affecting such 
cellular fimctions as DNA replication and/or DNA processing (e.g., DNA methylation or DNA repair), 
RNA transcription and/or RNA processing (e.g., removal of intronic sequences and/or translocation of 
spliced mRNA from the nucleus), polypeptide production (e.g., translation of the polypeptide from 
mRNA), and/or polypeptide post-translational modification (e.g., glycosylation, phosphorylation, and 
proteolysis of pro-polypeptides). A modulator may also agonize or antagonize a biological fiinction of a 
target molecule partially or completely, where the function may include adopting a certain structural 
conformation, interacting with one or more binding partners, ligand binding, catalysis (e.g., 
phosphorylation, dephosphorylation, hydrolysis, methylation, and isomerization), and an effect upon a 
cellular event {e.g., effecting progression of osteoarthritis). 

[0143] As used herein, the term "system" refers to a cell free in vitro environment and a cell-based 
environment such as a collection of cells, a tissue, an organ, or an organism. A system is "contacted" 
with a test molecule in a variety of manners, including adding molecules in solution and allowing them to 
interact with one another by diffiision, cell injection, and any administration routes in an animal. As used 
herein, the term "interaction" refers to an effect of a test molecule on test molecule, where the effect 
sometimes is binding between the test molecule and the target molecule, and sometimes is an observable 
change in cells, tissue, or organism. 

[0144] There are many standeird methods for detecting the presence or absence of interaction 
between a test molecule and a target molecule. For example, titrametric, acidimetric, radiometric, NMR, 
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monolayer, polarographic, spectrophotometric, fluorescent, and ESR assays probative of a target 
molecule interaction may be utilized. Any modulator can be utilized in methods for detecting an 
interaction. For example, proteasome modulators (e.g., PSMBl includes a proteasome domain) are 
described in WO-20040 14882 and Roesel et al. Proceedings of the American Association of Cancer 
Research 2003, 44:1st Ed (Abs 1769), and bortezomib (Velcade, MLN-341, LDP-341 and PS-341), a 
ubiquitin proteosome inhibitor, is used for the treatment of multiple myeloma. 

[0145] Test molecule/target molecule interactions can be detected and/or quantified using assays 
known in the art. For example, an interaction can be determined by labeling the test molecule and/or the 
target molecule, where the label is covalently or non-covalently attached to the test molecule or target 
molecule. The label is sometimes a radioactive molecule such as '^^I, '^'l, ^^S or ^H, which can be 
detected by direct counting of radioemission or by scintillation counting. Also, enzymatic labels such as 
horseradish peroxidase, alkaline phosphatase, or luciferase may be utilized where the enzymatic label can 
be detected by determining conversion of an appropriate substrate to product. In addition, presence or 
absence of an interaction can be determined without labeling. For example, a microphysiometer {e.g., 
Cytosensor) is an analytical instrument that measvires the rate at which a cell acidifies its environment 
usmg a light-addressable potentiometric sensor (LAPS). Changes in this acidification rate can be used as 
an indication of an interaction between a test molecule and target molecule (McConnell, H. M. et al., 
Science 257: 1906-1912(1992)). 

[0146] In cell-based systems, cells typically include a PSMBl. TBP. PDCD2. ELP3. CHDCl or 
ERG nucleic acid, an encoded polypeptide, or substantially identical nucleic acid or polypeptide thereof, 
and are often of mammalian origin, although the cell can be of any origin. Whole cells, cell 
homogenates, and cell fractions {e.g., cell membrane fractions) can be subjected to analysis. Where 
interactions between a test molecule with a target polypeptide are monitored, soluble and/or membrane 
boxmd forms of the polypeptide may be utilized. Where membrane-boimd forms of the polypeptide are 
used, it may be desirable to utilize a solubilizing agent. Examples of such solubilizing agents include 
non-ionic detergents such as n-octylglucoside, n-dodecylglucoside, n-dodecylmaltoside, octanoyl-N- 
methylglucamide, decanoyl-N-methylglucamide, Triton® X-100, Triton® X-1 14, Thesit®, 
Isotridecypoly(ethylene glycol ether)^, 3-[(3-cholamidopropyl)dimethylamminio]-l -propane sulfonate 
(CHAPS), 3-[(3-cholamidopropyl)dimethylamminio]-2-hydroxy-l -propane sulfonate (CHAPSO), orN- 
dodecyl-N,N-dimethyl-3-ammonio-l -propane sulfonate. 

[0147] An interaction between a test molecule and target molecule also can be detected by 
monitoring fiuorescence energy transfer (FET) {see, e.g., Lakowicz et al, U.S. Patent No. 5,631,169; 
Stavrianopoulos et al. U.S. Patent No. 4,868,103). A fluorophore label on a first, "donor" molecule is 
selected such that its emitted fluorescent energy will be absorbed by a fluorescent label on a second. 
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"acceptor" molecule, which in turn is able to fluoresce due to the absorbed energy. Alternately, the 
"donor" polypeptide molecule may simply utilize the natural fluorescent energy of tryptophan residues. 
Labels are chosen that emit different wavelengths of light, such that the "acceptor" molecule label may 
be differentiated from that of the "donor". Since the efficiency of energy transfer between the labels is 
related to the distance separating the molecules, the spatial relationship between the molecules can be 
assessed. In a situation in which binding occurs between the molecules, the fluorescent emission of the 
"acceptor" molecule label in the assay should be maximal. An FET binding event can be conveniently 
measured through standard fluorometric detection means well known in the art (e.g. , using a fluorimeter). 

[0148] In another embodiment, determining the presence or absence of an interaction between a test 
molecule and a target molecule can be effected by monitoring surface plasmon resonance (see, e.g., 
Sjolander & Urbaniczk,y4«a/. Chem. 63: 2338-2345 (1991) and Szabo et al., Curr. Opin. Struct. Biol. 5: 
699-705 (1995)). "Siuface plasmon resonance" or "biomolecular interaction analysis (BIA)" can be 
utilized to detect biospecific interactions in real time, without labeling any of the interactants (e.g., 
BIAcore). Changes in the mass at the binding surface (indicative of a binding event) result in alterations 
of the refractive index of light near the surface (the optical phenomenon of surface plasmon resonance 
(SPR)), resulting in a detectable signal which can be used as an indication of real-time reactions between 
biological molecules. 

[0149] In another embodiment, the target molecule or test molecules are anchored to a solid phase, 
facilitating the detection of target molecule/test molecvile complexes and separation of the complexes 
from free, imcomplexed molecules. The target molecule or test molecule is immobilized to the solid 
support. In an embodiment, the target molecule is anchored to a solid surface, and the test molecule, 
which is not anchored, can be labeled, either directly or indirectly, with detectable labels discussed 
herein. 

[0150] It may be desirable to immobilize a target molecule, an anti-target molecxile antibody, and/or 
test molecules to facilitate separation of target molecule/test molecule complexes from uncomplexed 
forms, as well as to accommodate automation of the assay. The attachment between a test molecule 
and/or target molecule and the solid support may be covalent or non-covalent {see, e.g., U.S. Patent No. 
6,022,688 for non-covalent attachments). The solid support may be one or more surfaces of the system, 
such as one or more surfaces in each well of a microtiter plate, a surface of a silicon wafer, a surface of a 
bead {see, e.g.. Lam, Nature 354: 82-84 (1991)) that is optionally linked to another solid support, or a 
channel in a microfluidic device, for example. Types of solid supports, linker molecules for covalent and 
non-covalent attachments to solid supports, and methods for immobilizing nucleic acids and other 
molecules to solid supports are well known {see. e.g., U.S. Patent Nos. 6,261,776; 5,900,481; 6,133,436; 
and 6,022,688; and WIPO publication WO 01/18234). 
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[0151] In an embodiment, target molecule may be immobilized to surfaces via biotin and 
streptavidin. For example, biotinylated target polypeptide can be prepared from biotin-NHS (N-hydroxy- 
succinimide) using techniques known in the art (e.g. , biotinylation kit, Pierce Chemicals, Rockford, IL), 
and immobilized in the wells of streptavidin-coated 96 well plates (Pierce Chemical). In another 
embodiment, a target polypeptide can be prepared as a fusion polypeptide. For example, glutathione-S- 
transferase/target polypeptide fusion can be adsorbed onto glutathione sepharose beads (Sigma Chemical, 
St. Louis, MO) or glutathione derivitized microliter plates, which are then combined with a test molecule 
under conditions conducive to complex formation (e.g., at physiological conditions for salt and pH). 
Following incubation, the beads or microtiter plate wells are washed to remove any unbound 
components, or the matrix is immobilized in the case of beads, and complex formation is determined 
directly or indirectly as described above. Alternatively, the complexes can be dissociated from the 
matrix, and the level of target molecule binding or activity is determined using standard techniques. 

[0152] In an embodiment, the non-immobilized component is added to the coated surface containing 
the anchored component. After the reaction is complete, unreacted components are removed (e.g., by 
washing) under conditions such that a significant percentage of complexes formed will remain 
immobilized to the solid surface. The detection of complexes anchored on the solid surface can be 
accomplished in a number of manners. Where the previously non-immobilized component is pre-labeled, 
the detection of label immobilized on the surface indicates that complexes were formed. Where the 
previously non-immobilized component is not pre-labeled, an indirect label can be used to detect 
complexes anchored on the surface, e.g., by adding a labeled antibody specific for the immobilized 
component, where the antibody, in turn, can be directly labeled or indirectly labeled with, e.g., a labeled 
anti-Ig antibody. 

[0153] In another embodiment, an assay is performed utilizing emtibodies that specifically bind 
target molecule or test molecule but do not interfere with binding of the target molecule to the test 
molecule. Such antibodies can be derivitized to a solid support, and unbound target molecule may be 
immobilized by antibody conjugation. Methods for detecting such complexes, in addition to those 
described above for the GST-immobilized complexes, include immunodetection of complexes using 
antibodies reactive with the target molecule, as well as enzyme-linked assays which rely on detecting an 
enzymatic activity associated with the target molecule. 

[0154] Cell free assays also can be conducted in a liquid phase. In such an assay, reaction products 
are separated from imreacted components, by any of a number of standard techniques, including but not 
limited to: differential centrifiigation (see. e.g., Rivas, G., and Minton, Trends Biochem Sci Aug; 18(8): 
284-7 (1993)); chromatography (gel filtration chromatography, ion-exchange chromatography); 
electrophoresis {see, e.g., Ausubel et al., eds. Current Protocols in Molecular Biology , J. Wiley: New 
York (1999)); and unmimoprecipitation (see, e.g., Ausubel et al., eds., supra). Media and 
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chromatographic techniques are known to one skilled in the art {see, e.g., Heegaard, J Mol. Recognit. 
Winter; 11(1-6): 141-8 (1998); Hage & Tweed, J. Chromatogr. B Biomed. Sci. Appl. Oct 10; 699 (1-2): 
499-525 (1997)). Further, fluorescence energy transfer may also be conveniently utilized, as described 
herein, to detect binding without further purification of the complex fi-om solution. 

[0155] In another embodiment, modulators of target molecule expression are identified. For 
example, a cell or cell free mixture is contacted with a candidate compound and the expression of target 
mRNA or target polypeptide is evaluated relative to the level of expression of target mRNA or target 
polypeptide in the absence of the candidate compound. When expression of target mRNA or target 
polypeptide is greater in the presence of the candidate compound than in its absence, the candidate 
compound is identified as an agonist of target mRNA or target polypeptide expression. Alternatively, 
when expression of target mRNA or target polypeptide is less (e.g., less with statistical significance) in 
the presence of the candidate compound than in its absence, the candidate compound is identified as an 
antagonist or inhibitor of target mRNA or target polypeptide expression. The level of target mRNA or 
target polypeptide expression can be determined by methods described herein. 

[0156] In another embodiment, binding partners that interact with a target molecule are detected. 
The target molecules can interact with one or more celluleir or extracellular macromolecules, such as 
polypeptides in v/vo, and these interacting molecules are referred to herein as "binding partners." 
Binding partners can agonize or antagonize target molecule biological activity. Also, test molecules that 
agonize or antagonize interactions between target molecules and binding partners can be useful as 
therapeutic molecules as they can up-regulate or down-regulated target molecule activity in vivo and 
thereby treat osteoarthritis. 

[0157] Binding partners of target molecules can be identified by methods known in the art. For 
example, binding partners may be identified by lysing cells and analyzing cell lysates by electrophoretic 
techniques. Alternatively, a two-hybrid assay or three-hybrid assay can be utilized {see, e.g., U.S. Patent 
No. 5,283,317; Zervos et al., Cell 72:223-232 (1993); Madura et al. J. Biol. Chem. 268: 12046-12054 
(1993); Bartel etal, Biotechniques 14: 920-924 (1993); Iwabuchi etal. Oncogene 8: 1693-1696 (1993); 
and Brent WO94/10300). A two-hybrid system is based on the modular nature of most transcription 
factors, which consist of separable DNA-binding and activation domains. The assay often utilizes two 
different DNA constructs. In one construct, a PSMBl, TBP, PDCD2, ELP3, CHDCl or ERG nucleic 
acid (sometimes referred to as the "bait") is fused to a gene encoding the DNA binding domain of a 
known transcription factor (e.g., GAL-4). In another construct, a DNA sequence from a library of DNA 
sequences that encodes a potential binding partner (sometimes referred to as the "prey") is fused to a 
gene that encodes an activation domain of the known transcription factor. Sometimes, a PSMBl, TBP, 
PDCD2, ELP3, CHDCl or ERG nucleic acid can be fused to the activation domain. If the "bait" and the 
"prey" molectiles interact in vivo, the DNA-binding and activation domains of the transcription factor are 
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brought into close proximity. This proximity allows transcription of a reporter gene (e.g., LacZ) which is 
operably linked to a transcriptional regulatory site responsive to the transcription factor. Expression of 
the reporter gene can be detected and cell colonies containing the functional transcription factor can be 
isolated and used to identify the potential binding partner. 

[0158] In an embodiment for identifying test molecules that antagonize or agonize complex 
formation between target molecules and binding partners, a reaction mixture containing the target 
molecule and the binding partner is prepared, under conditions and for a time sufficient to allow complex 
formation. The reaction mixture often is provided in the presence or absence of the test molecule. The 
test molecule can be included initially in the reaction mixtiire, or can be added at a time subsequent to the 
addition of the target molecule and its binding partner. Control reaction mixtures are incubated without 
the test molecule or with a placebo. Formation of any complexes between the target molecule and the 
binding partner then is detected. Decreased formation of a complex in the reaction mixture containing 
test molecule as compared to in a control reaction mixture indicates that the molecule antagonizes target 
molecule/binding partner complex formation. Alternatively, increased formation of a complex in the 
reaction mixture containing test molecule as compared to in a control reaction mixture indicates that the 
molecule agonizes target molecule/binding partner complex formation. In another embodiment, complex 
formation of target molecule/binding partner can be compared to complex formation of mutant target 
molecule/binding partner (e.g., amino acid modifications in a target polypeptide). Such a comparison can 
be important in those cases where it is desirable to identify test molecules that modulate interactions of 
mutant but not non-mutated target gene products. 

[0159] The assays can be conducted in a heterogeneous or homogeneous format. In heterogeneous 
assays, target molecule and/or the binding partner are immobilized to a solid phase, and complexes are 
detected on the solid phase at the end of the reaction. In homogeneous assays, the entire reaction is 
carried out in a liquid phase. In either approach, the order of addition of reactants can be varied to obtain 
different information about the molecvdes being tested. For example, test compounds that agonize target 
molecule/binding partner interactions can be identified by conducting the reaction in the presence of the 
test molecule in a competition format. Alternatively, test molecules that agonize preformed complexes, 
e.g., molecules with higher binding constants that displace one of the components from the complex, can 
be tested by adding the test compound to the reaction mixture after complexes have been formed. 

[0160] In a heterogeneous assay embodiment, the target molecule or the binding partner is anchored 
onto a solid surface (e.g., a microtiter plate), while the non-anchored species is labeled, either directly or 
indirectly. The anchored molecule can be immobilized by non-covalent or covalent attachments. 
Alternatively, an immobilized antibody specific for the molecule to be anchored can be used to anchor 
the molecxale to the solid surface. The partner of the immobilized species is exposed to the coated surface 
with or without the test molecule. After the reaction is complete, unreacted components are removed 
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(e.g., by washing) such that a significant portion of any complexes formed will remain immobilized on 
the solid surface. Where the non-immobilized species is pre-labeled, the detection of label immobilized 
on the surface is indicative of complex. Where the non-immobilized species is not pre-labeled, an 
indirect label can be used to detect complexes anchored to the surface; e.g. , by using a labeled antibody 
specific for the initially non-immobilized species. Depending upon the order of addition of reaction 
components, test compounds that inhibit complex formation or that disrupt preformed complexes can be 
detected. 

[0161] hi another embodiment, the reaction can be conducted in a liquid phase in the presence or 
absence of test molecule, where the reaction products are separated from unreacted components, and the 
complexes are detected {e.g., using an immobilized antibody specific for one of the binding components 
to anchor any complexes formed in solution, and a labeled antibody specific for the other partner to 
detect anchored complexes). Again, depending upon the order of addition of reactants to the liquid 
phase, test compounds that inhibit complex or that disrupt preformed complexes can be identified. 

[0162] In an alternate embodiment, a homogeneous assay can be utilized. For example, a preformed 
complex of the target gene product and the interactive cellular or extracellular binding partner product is 
prepared. One or both of the target molecule or binding partner is labeled, and the signal generated by 
the label(s) is quenched upon complex formation (e.g., U.S. Patent No. 4,109,496 that utilizes this 
approach for immimoassays). Addition of a test molecxxle that competes with and displaces one of the 
species from the preformed complex will result in the generation of a signal above background, hi this 
way, test substances that disrupt target molecule/binding partner complexes can be identified. 

[0163] Candidate therapeutics for treating osteoarthritis are identified from a group of test molecules 
that interact with a target molecule. Test molecules are normally ranked according to the degree with 
which they modulate {e.g., agonize or antagonize) a fimction associated with the target molecule {e.g., 
DNA replication and/or processing, RNA transcription and/or processing, polypeptide production and/or 
processing, and/or biological function/activity), and then top ranking modulators are selected. Also, 
pharmacogenomic information described herein can determine the rank of a modulator. The top 10% of 
ranked test molecules often are selected for frirther testing as candidate therapeutics, and sometimes the 
top 15%, 20%, or 25% of ranked test molecules are selected for further testing as candidate therapeutics. 
Candidate therapeutics typically are formulated for administration to a subject. 

Therapeutic Formulations 

[0164] Formulations and pharmaceutical compositions typically include in combination with a 
pharmaceutically acceptable carrier one or more target molecule modulators. The modulator often is a 
test molecule identified as having an interaction with a target molecule by a screening method described 
above. The modulator may be a compound, an antisense nucleic acid, a ribozyme, an antibody, or a 
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binding partner. Also, formulations may comprise a target polypeptide or fragment thereof in 
combination with a pharmaceutically acceptable carrier. 

[0165] As used herein, the term "pharmaceutically acceptable carrier" includes solvents, dispersion 
media, coatings, antibacterial and antifiingal agents, isotonic and absorption delaying agents, and the like, 
compatible with pharmaceutical administration. Supplementary active compounds can also be 
incorporated into the compositions. Pharmaceutical compositions can be included in a container, pack, or 
dispenser together with instructions for administration. 

[0166] A pharmaceutical composition typically is formulated to be compatible with its intended 
route of administration. Examples of routes of administration include parenteral, e.g., intravenous, 
intradermal, subcutaneous, oral (e.g. , inhalation), transdermal (topical), transmucosal, and rectal 
administration. Solutions or suspensions used for parenteral, intradermal, or subcutaneous application 
can include the following components: a sterile diluent such as water for injection, saline solution, fixed 
oils, polyethylene glycols, glycerin, propylene glycol or other synthetic solvents; antibacterial agents 
such as benzyl alcohol or methyl parabens; antioxidants such as ascorbic acid or sodium bisulfite; 
chelating agents such as ethylenediaminetetraacetic acid; buffers such as acetates, citrates or phosphates 
and agents for the adjustment of tonicity such as sodium chloride or dextrose. pH can be adjusted with 
acids or bases, such as hydrochloric acid or sodium hydroxide. The parenteral preparation can be 
enclosed in ampoules, disposable syringes or multiple dose vials made of glass or plastic. 

[0167] Oral compositions generally include an inert diluent or an edible carrier. For the purpose of 
oral therapeutic administration, the active compoimd can be incorporated with excipients and used in (he 
form of tablets, troches, or capsules, e.g., gelatin capsules. Oral compositions can also be prepared using 
a fluid carrier for use as a mouthwash. Pharmaceutically compatible binding agents, and/or adjuvant 
materials can be included as part of the composition. The tablets, pills, capsules, troches and the like can 
contain any of the following ingredients, or compounds of a similar nature: a binder such as 
microcrystalline cellulose, gum tragacanth or gelatin; an excipient such as starch or lactose, a 
disintegrating agent such as alginic acid, Primogel, or com starch; a lubricant such as magnesium stearate 
or Sterotes; a glidant such as colloidal silicon dioxide; a sweetening agent such as sucrose or saccharin; 
or a flavoring agent such as peppermint, methyl salicylate, or orange flavoring. 

[0168] Pharmaceutical compositions suitable for injectable use include sterile aqueous solutions 
(where water soluble) or dispersions and sterile powders for the extemporaneous preparation of sterile 
injectable solutions or dispersion. For intravenous administration, suitable carriers include physiological 
saline, bacteriostatic water, Cremophor EL™ (BASF, Parsippany, NJ) or phosphate buffered saline 
(PBS). In all cases, the composition must be sterile and should be fluid to the extent that easy 
syringability exists. It should be stable imder the conditions of manufacture and storage and must be 
preserved against the contaminating action of microorganisms such as bacteria and fimgi. The carrier can 
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be a solvent or dispersion medium containing, for example, water, ethanol, polyol (for example, glycerol, 
propylene glycol, and liquid polyethylene glycol, and the like), and suitable mixtures thereof. The proper 
fluidity can be maintained, for example, by the use of a coating such as lecithin, by the maintenance of 
the required particle size in the case of dispersion and by the use of surfactants. Prevention of the action 
of microorganisms can be achieved by various antibacterial and antifungal agents, for example, parabens, 
chlorobutanol, phenol, ascorbic acid, thimerosal, and the like. In many cases, it will be preferable to 
include isotonic agents, for example, sugars, polyalcohols such as mannitol, sorbitol, sodium chloride in 
the composition. Prolonged absorption of the injectable compositions can be brought about by including 
in the composition an agent which delays absorption, for example, aluminum monostearate and gelatin. 

[0169] Sterile injectable solutions can be prepared by incorporating the active compound in the 
required amount in an appropriate solvent with one or a combination of ingredients enumerated above, as 
required, followed by filtered sterilization. Generally, dispersions are prepared by incorporating the 
active compound into a sterile vehicle which contains a basic dispersion medium and the required other 
ingredients from those enumerated above. In the case of sterile powders for the preparation of sterile 
injectable solutions, the preferred methods of preparation are vacuum drying and freeze-drying which 
yields a powder of the active ingredient plus any additional desired ingredient from a previously sterile- 
filtered solution thereof. 

[0170] For administration by inhalation, the compoimds are delivered in the form of an aerosol 
spray from pressured container or dispenser which contains a suitable propellant, e.g., a gas such as 
carbon dioxide, or a nebulizer. 

[0171] Systemic administration can also be by transmucosal or transdermal means. For 
transmucosal or transdermal administration, penetrants appropriate to the barrier to be permeated are 
used in the formulation. Such penetrants are generally known in the art, and include, for example, for 
transmucosal administration, detergents, bile salts, and fusidic acid derivatives. Transmucosal 
administration can be accomplished through the use of nasal sprays or suppositories. For transdermal 
administration, the active compounds are formulated into ointments, salves, gels, or creams as generally 
known in the art. Molecules can also be prepared in the form of suppositories (e.g., with conventional 
suppository bases such as cocoa butter and other glycerides) or retention enemas for rectal delivery. 

[0172] In one embodiment, active molecules are prepared with carriers that will protect the 
compoimd against rapid elimination from the body, such as a controlled release formulation, including 
implants and microencapsulated delivery systems. Biodegradable, biocompatible polymers can be used, 
such as ethylene vinyl acetate, polyanhydrides, polyglycolic acid, collagen, polyorthoesters, and 
polylactic acid. Methods for preparation of such formulations will be apparent to those skilled in the art. 
Materials can also be obtained commercially from Alza Corporation and Nova Pharmaceuticals, Inc. 
Liposomal suspensions (including liposomes targeted to infected cells with monoclonal antibodies to 
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viral antigens) can aiso be used as pharmaceuticcilly acceptable carriers. These can be prepared 
according to methods known to those skilled in the art, for example, as described in U.S. Patent 
No. 4,522,811. 

[0173] It is advantageous to formulate oral or parenteral compositions in dosage imit form for ease 
of administration and uniformity of dosage. Dosage unit form as used herein refers to physically discrete 
units suited as unitary dosages for the subject to be treated; each unit containing a predetermined quantity 
of active compoimd calculated to produce the desired therapeutic effect in association with the required 
pharmaceutical carrier. 

[0174] Toxicity and therapeutic efficacy of such compounds can be determined by standard 
pharmaceutical procedures in cell cultures or experimental animals, e.g., for determining the LD50 (the 
dose lethal to 50% of the population) and the ED50 (the dose therapeutically effective in 50% of the 
population). The dose ratio between toxic and therapeutic effects is the therapeutic index and it can be 
expressed as the ratio LD50/ED50. Molecules which exhibit high therapeutic indices are preferred. While 
molecules that exhibit toxic side effects may be used, care should be taken to design a delivery system 
that targets such compounds to the site of affected tissue in order to minimize potential damage to 
uninfected cells and, thereby, reduce side effects. 

[0175] The data obtained from the cell culture assays and animal studies can be used in formulating 
a range of dosage for use in humans. The dosage of such molecules lies preferably within a range of 
circulating concentrations that include the ED50 with little or no toxicity. The dosage may vary within 
this range depending upon the dosage form employed and the route of administration utilized. For any 
molecules used in the methods described herein, the therapeutically effective dose can be estimated 
initially from cell culture assays. A dose may be formulated in animal models to achieve a circulating 
plasma concentration range that includes the IC50 (i.e., the concentration of the test compoimd which 
achieves a half-maximal inhibition of syn^toms) as determined in cell culture. Such information can be 
used to more accurately determine useful doses in himians. Levels in plasma may be measured, for 
example, by high performance liquid chromatography. 

[0176] As defined herein, a therapeutically effective amount of protein or polypeptide (i.e., an 
effective dosage) ranges from about 0.001 to 30 mg/kg body weight, sometimes about 0.01 to 25 mg/kg 
body weight, often about 0.1 to 20 mg/kg body weight, and more often about 1 to 10 mg/kg, 2 to 9 
mg/kg, 3 to 8 mg/kg, 4 to 7 mg/kg, or 5 to 6 mg/kg body weight. The protein or polypeptide can be 
administered one time per week for between about 1 to 10 weeks, sometimes between 2 to 8 weeks, often 
between about 3 to 7 weeks, and more often for about 4, 5, or 6 weeks. The skilled artisan will 
appreciate that certain factors may influence the dosage and timing required to effectively treat a subject, 
including but not limited to the severity of the disease or disorder, previous treatments, the general health 
and/or age of the subject, and other diseases present. Moreover, treatment of a subject with a 
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therapeutically effective amount of a protein, polypeptide, or antibody can include a single treatment or, 
preferably, can include a series of treatments. 

[0177] With regard to polypeptide formulations, feat\ired herein is a method for treating 
osteoarthritis in a subject, which comprises contacting one or more cells in the subject with a first 
polypeptide, where the subject comprises a second polypeptide having one or more polymorphic 
variations associated with cancer, and where the first polypeptide comprises fewer polymorphic 
variations associated with cancer than the second polypeptide. The first and second polypeptides are 
encoded by a nucleic acid which comprises a nucleotide sequence in SEQ ED NO: 1 -1 2; a nucleotide 
sequence which encodes a polypeptide consisting of an amino acid sequence encoded by a nucleotide 
sequence referenced in SEQ ID NO: 1-12; a nucleotide sequence which encodes a polypeptide that is 
90% or more identical to an amino acid sequence encoded by a nucleotide sequence of SEQ ID NO: 1-12 
and a nucleotide sequence 90% or more identical to a nucleotide sequence in SEQ ID NO: 1-12. The 
subject often is a human. 

[0178] For antibodies, a dosage of 0.1 mg/kg of body weight (generally 1 0 mg/kg to 20 mg/kg) is 
often utilized. If the antibody is to act in the brain, a dosage of 50 mg/kg to 100 mg/kg is often 
appropriate. Generally, partially human antibodies and fully human antibodies have a longer half-life 
within the human body than other antibodies. Accordingly, lower dosages and less frequent 
administration is often possible. Modifications such as lipidation can be used to stabilize antibodies and 
to enhance uptake and tissue penetration (e.g., into the brain). A method for lipidation of antibodies is 
described by Cruikshank et al., J. Acquired Immune Deficiency Syndromes and Human Retrovirology 
14:193 (1997). 

[0179] Antibody conjugates can be used for modifying a given biological response, the drug moiety 
is not to be construed as limited to classical chemical therapeutic agents. For example, the drug moiety 
may be a protein or polypeptide possessing a desired biological activity. Such proteins may include, for 
example, a toxin such as abrin, ricin A, pseudomonas exotoxin, or diphtheria toxin; a polypeptide such as 
tumor necrosis factor, alpha-interferon, beta-interferon, nerve growth factor, platelet derived growth 
factor, tissue plasminogen activator; or, biological response modifiers such as, for example, lymphokines, 
interleukin-1 ("IL-l"), interleukin-2 ("IL-2"), interleukin-6 ("IL-6"), granulocyte macrophage colony 
stimulating factor ("GM-CSF"), granulocyte colony stimulating factor ("G-CSF"), or other growth 
factors. Alternatively, an antibody can be conjugated to a second antibody to form an antibody 
heteroconjugate as described by Segal in U.S. Patent No. 4,676,980. 

[0180] For compounds, exemplary doses include milligram or microgram amovints of the compound 
per kilogram of subject or sample weight, for example, about 1 microgram per kilogram to about 500 
milligrams per kilogram, about 100 micrograms per kilogram to about 5 milligrams per kilogram, or 
about 1 microgram per kilogram to about 50 micrograms per kilogram. It is understood that appropriate 
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doses of a small molecule depend upon the potency of the small molecule with respect to the expression 
or activity to be modulated. When one or more of these small molecules is to be administered to an 
animal (e.g., a human) in order to modulate expression or activity of a polypeptide or nucleic acid 
described herein, a physician, veterinarian, or researcher may, for example, prescribe a relatively low 
dose at first, subsequently increasing the dose imtil an appropriate response is obtained. In addition, it is 
understood that the specific dose level for any particular animal subject will depend upon a variety of 
factors including the activity of the specific compound employed, the age, body weight, general health, 
gender, and diet of the subject, the time of administration, the route of administration, the rate of 
excretion, any drug combination, and the degree of expression or activity to be modulated. 

[0181] With regard to nucleic acid formulations, gene therapy vectors can be delivered to a subject 
by, for example, intravenous injection, local administration (see, e.g., U.S. Patent 5,328,470) or by 
stereotactic injection (see e.g., Chen et ai, (1994) Proc. Natl. Acad. Sci. USA 97:3054-3057). 
Pharmaceutical preparations of gene therapy vectors can include a gene therapy vector in an acceptable 
diluent, or can comprise a slow release matrix in which the gene delivery vehicle is imbedded. 
Alternatively, where the complete gene delivery vector can be produced intact fi-om recombinant cells 
(e.g., retroviral vectors) the pharmaceutical preparation can include one or more cells which produce the 
gene dehvery system. Examples of gene delivery vectors are described herein. 

Therapeutic Methods 

[0182] A therapeutic formulation described above can be administered to a subject in need of a 
therapeutic for inducing a desired biological response. Therapeutic formulations can be administered by 
any of the paths described herein. With regard to both prophylactic and therapeutic methods of 
treatment, such treatments may be specifically tailored or modified, based on knowledge obtained from 
pharmacogenomic analyses described herein. 

[0183] As used herein, the term "treatment" is defined as the application or administration of a 
therapeutic formulation to a subject, or application or administration of a therapeutic agent to an isolated 
tissue or cell line fi-om a subject with the purpose to cure, heal, alleviate, relieve, alter, remedy, 
ameliorate, improve or affect osteoarthritis, sjmiptoms of osteoarthritis or a predisposition towards 
osteoarthritis. A therapeutic formulation includes, but is not limited to, small molecules, peptides, 
antibodies, ribozymes and antisense oligonucleotides. Administration of a therapeutic formulation can 
occur prior to the manifestation of symptoms characteristic of osteoarthritis, such that osteoarthritis is 
prevented or delayed in its progression. The appropriate therapeutic composition can be determined 
based on screening assays described herein. 

[0184] As discussed, successfiil treatment of osteoarthritis can be brought about by techniques that 
serve to agonize target molecule expression or fimction, or alternatively, antagonize target molecule 
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expression or function. These techniques include administration of modulators that include, but are not 
limited to, small organic or inorganic molecules; antibodies (including, for example, polyclonal, 
monoclonal, humanized, anti-idiotypic, chimeric or single chain antibodies, and Fab, F(ab')2 and Fab 
expression library fragments, scFV molecules, and epitope-binding fragments thereof); and peptides, 
phosphopeptides, or polypeptides. 

[0185] Further, antisense and ribozyme molecules that inhibit expression of the target gene can also 
be used to reduce the level of target gene expression, thus effectively reducing the level of target gene 
activity. Still ftirther, triple helix molecules can be utilized in reducing the level of target gene activity. 
Antisense, ribozyme and triple helix molecules are discussed above. It is possible that the use of 
antisense, ribozyme, and/or triple helix molecules to reduce or inhibit mutant gene expression can also 
reduce or inhibit the transcription (triple helix) and/or translation (antisense, ribozyme) of mRNA 
produced by normal target gene alleles, such that the concentration of normal target gene product present 
can be lower than is necessary for a normal phenotype. In such cases, nucleic acid molecules that encode 
and express target gene polypeptides exhibiting normal target gene activity can be introduced into cells 
via gene therapy method. Alternatively, in instances in that the target gene encodes an extracellular 
polypeptide, it can be preferable to co-administer normal target gene polypeptide into the cell or tissue in 
order to maintain the requisite level of cellular or tissue target gene activity. 

[01861 Another method by which nucleic acid molecules may be utilized in treating or preventing 
osteoarthritis is use of aptamer molecules specific for target molecules. Aptamers are nucleic acid 
molecules having a tertiary structure which permits them to specifically bind to ligands {see, e.g., 
Osborne, etal. Curr. Opin. Chem. Biol. I (J): 5-9 (1997); andPatel, D. J., Curr. Opin. Chem. Biol. 
Jun;l(l): 32-46 (1997)). 

[0187] Yet another method of utilizing nucleic acid molecules for osteoarthritis treatment is gene 
therapy, which can also be referred to as allele therapy. Provided herein is a gene therapy method for 
treating osteoarthritis in a subject, which comprises contacting one or more cells in the subject or from 
the subject with a nucleic acid having a first nucleotide sequence (e.g., the first nucleotide sequence is 
identical to or substantially identical to a nucleotide sequence of SEQ ID NO: 1-12). Genomic DNA in 
the subject comprises a second nucleotide sequence having one or more polymorphic variations 
associated with osteoarthritis (e.g., the second nucleotide sequence is identical to or substantially 
identical to a nucleotide sequence of SEQ ID NO: 1-4). The first and second nucleotide sequences 
typically are substantially identical to one another, and the first nucleotide sequence comprises fewer 
polymorphic variations associated with osteoarthritis than the second nucleotide sequence. The first 
nucleotide sequence may comprise a gene sequence that encodes a fiiU-length polypeptide or a fragment 
thereof. The subject is often a human. Allele therapy methods often are utilized in conjunction with a 
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method of first determining whether a subject has genomic DNA that includes polymorphic variants 
associated with osteoarthritis. 

[0188] In another allele therapy embodiment, provided herein is a method which comprises 
contacting one or more cells in the subject or fi-om the subject with a polypeptide encoded by a nucleic 
acid having a first nucleotide sequence (e.g., the first nucleotide sequence is identical to or substantially 
identical to the nucleotide sequence of SEQ ID NO: 1-12). Genomic DNA in the subject comprises a 
second nucleotide sequence having one or more polymorphic variations associated with osteoarthritis 
(e.g., the second nucleotide sequence is identical to or substantially identical to a nucleotide sequence of 
SEQ ID NO: 1-4). The first and second nucleotide sequences typically are substantially identical to one 
another, and the first nucleotide sequence comprises fewer polymorphic variations associated with 
osteoarthritis than the second nucleotide sequence. The first nucleotide sequence may comprise a gene 
sequence that encodes a full-length polypeptide or a fragment thereof. The subject is often a human. 

[01891 For antibody-based therapies, antibodies can be generated that are both specific for target 
molecules and that reduce target molecule activity. Such antibodies may be administered in instances 
where antagonizing a target molecule function is appropriate for the treatment of osteoarthritis. 

[0190] In circumstances where stimulating antibody production in an animal or a human subject by 
injection with a target molecule is harmfiil to the subject, it is possible to generate an immime response 
against the target molecule by use of anti-idiotypic antibodies {see, e.g., Herlyn, Ann. Med.;31(l): 66-78 
(1999); and Bhattacharya-Chatterjee & Foon, Cancer Treat. Res.; 94: 51-68 (1998)). Introducing an 
anti-idiotypic antibody to a mammal or human subject often stimulates production of anti-anti-idiotypic 
antibodies, which typically are specific to the target molecule. Vaccines directed to osteoarthritis also 
may be generated in this fashion. 

[0191] In instances where the target molecule is intracellular and whole antibodies are used, 
internalizing antibodies may be preferred. Lipofectin or liposomes can be used to deliver the antibody or 
a Segment of the Fab region that binds to the target antigen into cells. Where fragments of the antibody 
are used, the smallest inhibitory fragment that binds to the target antigen is preferred. For example, 
peptides having an amino acid sequence corresponding to the Fv region of the antibody can be used. 
Alternatively, single chain neutralizing antibodies that bind to intracellular target antigens can also be 
administered. Such single chain antibodies can be administered, for example, by expressing nucleotide 
sequences encoding single-chain antibodies within the target cell population {see, e.g., Marasco et al., 
Proc. Natl. Acad. Sci. USA 90: 7889-7893 (1993)). 

[0192] Modulators can be administered to a patient at therapeutically effective doses to treat 
osteoarthritis. A therapeutically effective dose refers to an amount of the modulator sufficient to result in 
amelioration of symptoms of osteoarthritis. Toxicity and therapeutic efficacy of modulators can be 
determined by standard pharmaceutical procedures in cell cultures or experimental animals, e.g., for 
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determining the LD50 (the dose lethal to 50% of the population) and the ED50 (the dose therapeutically 
effective in 50% of the population). The dose ratio between toxic and therapeutic effects is the 
therapeutic index and it can be expressed as the ratio LD50/ED50. Modulators that exhibit large 
therapeutic indices are preferred. While modulators that exhibit toxic side effects can be used, care 
should be taken to design a delivery system that targets such molecules to the site of affected tissue in 
order to minimize potential damage to uninfected cells, thereby reducing side effects. 

[0193] Data obtained from cell culture assays and animal studies can be used in formulating a range 
of dosages for use in humans. The dosage of such compounds lies preferably within a range of 
circulating concentrations that include the ED50 with little or no toxicity. The dosage can vary within this 
range depending upon the dosage form employed and the route of administration utilized. For any 
compound used in the methods described herein, the therapeutically effective dose can be estimated 
initially from cell culture assays. A dose can be formulated in animal models to achieve a circulating 
plasma concentration range that includes the IC50 (i.e., the concentration of the test compound that 
achieves a half-maximal inhibition of symptoms) as determined in cell culture. Such information can be 
used to more accurately determine useful doses in humans. Levels in plasma can be measured, for 
example, by high performance liquid chromatography. 

[0194] Another example of effective dose determination for an individual is the ability to directly 
assay levels of "free" and "bound" compound in the serum of the test subject. Such assays may utilize 
antibody mimics and/or "biosensors" that have been created through molecular imprinting techniques. 
Molecules that modulate target molecule activity are used as a template, or "imprinting molecule", to 
spatially organize polymerizable monomers prior to their polymerization with catalytic reagents. The 
subsequent removal of the imprinted molecule leaves a poljmier matrix which contains a repeated 
"negative image" of the compound and is able to selectively rebind the molecule under biological assay 
conditions. A detailed review of this technique can be seen in Ansell et al., Current Opinion in 
Biotechnology 7: 89-94 (1996) and in Shea, Trends in Polymer Science 2 : 166-173 (1994). Such 
"imprinted" affinity matrixes are amenable to ligand-binding assays, whereby the immobilized 
monoclonal antibody component is replaced by an appropriately imprinted matrix. An example of the 
use of such matrixes in this way can be seen in Vlatakis, et al., Nature 361: 645-647 (1 993). Through the 
use of isotope-labeling, the "free" concentration of compound which modulates target molecule 
expression or activity readily can be monitored and used in calculations of IC50. Such "imprinted" 
affinity matrixes can also be designed to include fluorescent groups whose photon-emitting properties 
measurably change upon local and selective binding of target compoimd. These changes readily can be 
assayed in real time using appropriate fiberoptic devices, in turn allowing the dose in a test subject to be 
qxiickly optimized based on its individual ICso. An example of such a "biosensor" is discussed in Kriz et 
al. Analytical Chemistry 67: 2142-2144 (1995). 
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[0195] The examples set forth below are intended to illustrate but not limit the invention. 
Examples 

[0196] In the following studies a group of subjects was selected according to specific parameters 
relating to osteoarthritis. Nucleic acid samples obtained from individuals in the study group were 
subjected to genetic analysis, which identified associations between osteoarthritis and polymorphisms in 
the following genes or regions: chromosome 6 (6q27), ELP3, CHDCl, and ERG (herein referred to as 
"targets"). The polymorphisms were genotyped again in two replication cohorts consisting of individuals 
selected for OA. In addition, SNPs proximal to the incident polymorphisms were identified and 
allelotyped in OA case and control pools. Methods are described for producing PSMBl, TBP, PDCD2, 
ELP3, CHDCl, and ERG polypeptide and polypeptide variants thereof m vitro or in vivo; PSMBl, TBP, 
PDCD2, ELP3. CHDCl, and ERG nucleic acids or polypeptides and variants thereof are utilized for 
screening test molecules for those that interact with PSMBl, TBP, PDCD2, ELP3, CHDCl, and ERG 
molecules. Test molecules identified as interactors with PSMBl, TBP, PDCD2, ELP3, CHDCl, and 
ERG molecules and variants thereof are further screened in vivo to determine whether they treat 
osteoarthritis. 

Example 1 
Samples and Pooling Strategies 

Sample Selection 

[0197] Blood samples were collected from individuals diagnosed with knee osteoarthritis, which 
were referred to as case samples. Also, blood samples were collected from individuals not diagnosed 
with knee osteoarthritis as gender and age-matched controls. A database was created that listed all 
phenotypic trait information gathered from individuals for each case and control sample. Genomic DNA 
was extracted from each of the blood samples for genetic analyses. 

DNA Extraction from Blood Samples 

[0198] Six to ten milliliters of whole blood was transferred to a 50 ml tube containing 27 ml of red 
cell lysis solution (RCL). The tube was inverted until the contents were mixed. Each tube was incubated 
for 10 minutes at room temperature and inverted once during the incubation. The tubes were then 
centrifliged for 20 minutes at 3000 x g and the supematant was carefrilly poured off. 100-200 ^1 of 
residual liquid was left in the tube and was pipetted repeatedly to resuspend the pellet in the residual 
supematant. White cell lysis solution (WCL) was added to the tube and pipetted repeatedly until 
completely mixed. While no incubation was normally required, the solution was incubated at 37°C or 
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room temperature if cell clumps were visible after mixing until the solution was homogeneous. 2 ml of 
protein precipitation was added to the cell lysate. The mixt\ires were vortexed vigorously at high speed 
for 20 sec to mix the protein precipitation solution uniformly with the cell lysate, and then centrifuged for 
10 minutes at 3000 x g. The supernatant containing the DNA was then poured into a clean 15 ml tube, 
which contained 7 ml of 100% isopropanol. The samples were mixed by inverting the tubes gently until 
white threads of DNA were visible. Samples were centrifiiged for 3 minutes at 2000 x g and the DNA 
was visible as a small white pellet. The supernatant was decanted and 5 ml of 70% ethanol was added to 
each tube. Each tube was inverted several times to wash the DNA pellet, and then centrifuged for 1 
minute at 2000 x g. The ethanol was decanted and each tube was drained on clean absorbent paper. The 
DNA was dried in the tube by inversion for 10 minutes, and then 1000 )j.l of IX TE was added. The size 
of each sample was estimated, and less TE buffer was added during the following DNA hydration step if 
the sample was smaller. The DNA was allowed to rehydrate overnight at room temperature, and DNA 
samples were stored at 2-8°C. 

[0199] DNA was quantified by placing samples on a hematology mixer for at least 1 hour. DNA 
was serially diluted (typically 1 :80, 1 : 160, 1 :320, and 1 :640 dilutions) so that it would be within the 
measurable range of standards. 125 pil of diluted DNA was transferred to a clear U-bottom microtitre 
plate, and 125 p-l of IX TE buffer was transferred into each well using a multichannel pipette. The DNA 
and IX TE were mixed by repeated pipetting at least 1 5 times, and then the plates were sealed. 50 jxl of 
diluted DNA was added to wells A5-H1 2 of a black flat bottom microtitre plate. Standards were inverted 
six times to mix them, and then 50 ^1 of IX TE buffer was pipetted into well Al, 1000 ng/ml of standard 
was pipetted into well A2, 500 ng/ml of standard was pipetted into well A3, and 250 ng/ml of standard 
was pipetted into well A4. PicoGreen (Molecular Probes, Eugene, Oregon) was thawed and freshly 
diluted 1 :200 according to the number of plates that were being measured. PicoGreen was vortexed and 
then 50\xl was pipetted into all wells of the black plate with the diluted DNA. DNA and PicoGreen were 
mixed by pipetting repeatedly at least 10 times with the multichannel pipette. The plate was placed into a 
Fluoroskan Ascent Machine (microplate fluorometer produced by Labsystems) and the samples were 
allowed to incubate for 3 minutes before the machine was run using filter pairs 485 nm excitation and 
538 nm emission wavelengths. Samples having measured DNA concentrations of greater than 450 ng/ju.1 
were re-measured for conformation. Samples having measured DNA concentrations of 20 ng/jal or less 
were re-measured for confirmation. 

Pooling Strategies - Discovery Cohort 

[0200] Samples were derived from the Nottingham knee OA family study (UK) where index cases 
were identified through a knee replacement registry. Siblings were approached and assessed with knee x- 
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rays and assigned status as affected or unaffected. In all 1 ,157 individuals were available. In order to 
create same-sex pools of appropriate sizes, 335 unrelated female individuals with OA fi-om the 
Nottingham OA sample were selected for the case pool. The control pool was made up of unrelated 
female individuals from the St. Thomas twin study (England) with normal knee x-rays and without other 
indications of OA, regardless of anatomical location, as well as lacking family history of OA. The St. 
Thomas twin study consists of Caucasian, female participants from the St. Thomas' Hospital, London, 
adult-twin registry, which is a voluntary registry of >4,000 twin pairs ranging from 1 8 to 76 years of age. 
The female case samples and female control samples are described further in Table 1 below. 

[0201] A select set of samples from each group were utilized to generate pools, and one pool was 
created for each group. Each individual sample in a pool was represented by an equal amount of 
genomic DNA. For example, where 25 ng of genomic DNA was utilized in each PGR reaction and there 
were 200 individuals in each pool, each individual would provide 1 25 pg of genomic DNA. Inclusion or 
exclusion of samples for a pool was based upon the following criteria: the sample was derived from an 
individual characterized as Caucasian; the sample was derived from an individual of British paternal and 
maternal descent; case samples were derived from individuals diagnosed with specific knee osteoarthritis 
(OA) and were recruited from an OA knee replacement clinic. Control samples were derived from 
individuals free of OA, family history of OA, and rheumatoid arthritis. Also, sufficient genomic DNA 
was extracted from each blood sample for all allelotyping and genotyping reactions performed during the 
study. Phenotype information from each individual was collected and included age of the individual, 
gender, family history of OA, general medical information (e.g., height, weight, thyroid disease, 
diabetes, psoriasis, hysterectomy), joint history (previous and current symptoms, joint-related operations, 
age at onset of symptoms, date of primary diagnosis, age of indiAndual as of primary diagnosis and order 
of involvement), and knee-related findings (crepitus, restricted passive movement, bony 
swelling/deformity). Additional knee information included knee history, cvirrent symptoms, any major 
knee injury, menisectomy, knee replacement surgery, age of surgery, and freatment history (including 
hormone replace therapy (HRT)). Samples that met these criteria were added to appropriate pools based 
on disease status. 

[0202] The selection process yielded the pools set forth in Table 1 , which were used in the studies 
that follow: 



TABLE 1 





Female case 


Female control 


Pool size 
(Nxmiber) 


335 


335 


Pool Criteria 

(ex: case/control) 


confrol 


case 
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Mean Age 
(ex: years) 



Example 2 

Association of Polymorphic Variants with Osteoarthritis 

[0203J A whole-genome screen was performed to identify particular SNPs associated with 
occurrence of osteoarthritis. As described in Example 1, two sets of samples were utilized, which 
included samples from female individuals having knee osteoarthritis (osteoarthritis cases), and samples 
from female individuals not having knee osteoarthritis (female controls). The initial screen of each pool 
was performed in an allelotyping study, in which certain samples in each group were pooled. By pooling 
DNA from each group, an allele frequency for each SNP in each group was calculated. These allele 
frequencies were then compared to one another. Particular SNPs were considered as being associated 
with osteoarthritis when allele frequency differences calculated between case and control pools were 
statistically significant. SNP disease association results obtained from the allelotyping study were then 
validated by genotyping each associated SNP across all samples from each pool. The results of the 
genotyping then were analyzed, allele frequencies for each group were calculated from the individual 
genotyping results, and a p-value was calculated to determine whether the case and control groups had 
statistically significant differences in allele frequencies for a particular SNP. When the genotyping 
results agreed with the original allelotyping results, the SNP disease association was considered validated 
at the genetic level. 

SNP Panel Used for Genetic Analyses 

[0204] A whole-genome SNP screen began with an initial screen of approximately 25,000 SNPs 
over each set of disease and control samples using a pooling approach. The pools studied in the screen 
are described in Example 1 . The SNPs analyzed in this study were part of a set of 25,488 SNPs 
confirmed as being statistically polymorphic as each is characterized as having a minor allele frequency 
of greater than 10%. The SNPs in the set reside in genes or in close proximity to genes, and many reside 
in gene exons. Specifically, SNPs in the set are located in exons, introns, and within 5,000 base-pairs 
upstream of a transcription start site of a gene. In addition, SNPs were selected according to the 
following criteria: they are located in ESTs; they are located in Locuslink or Ensembl genes; and they are 
located in Genomatix promoter predictions. SNPs in the set were also selected on the basis of even 
spacing across the genome, as depicted in Table 2. 

[0205] A case-control study design using a whole genome association strategy involving 
approximately 28,000 single nucleotide polymorphisms (SNPs) was employed. Approximately 25,000 
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SNPs were evenly spaced in gene-based regions of the human genome with a median inter-marker 
distance of about 40,000 base pairs. Additionally, approximately 3,000 SNPs causing amino acid 
substitutions in genes described in the literature as candidates for various diseases were used. The case- 
control study samples were of female Caucasian origin (British paternal and maternal descent) 670 
individuals were equally distributed in two groups: female controls and female cases. The whole genome 
association approach was first conducted on 2 DNA pools representing the 2 groups. Significant markers 
were confirmed by individual genotyping. 



TABLE 2 



General Statistics 


Spacing Statistics 


Total # of SNPs 


25,488 


Median 


37,058 bp 


#ofExonic SNPs 


>4,335 (17%) 


Minimum* 


1,000 bp 


# SNPs with refSNP ID 


20,776 (81%) 


Maximum* 


3,000,000 bp 


Gene Coverage 


> 10,000 


Mean 


122,412 bp 


Chromosome Coverage 


All 


Std Deviation 


373,325 bp 






^Excludes outliers 





Allelotvping and Genotyping Results 

[0206] The genetic studies summarized above and described in more detail below identified allelic 
variants in the target genes that are associated with osteoarthritis. 

Assay for Verifying. Allelotyping. and Genotyping SNPs 

[0207] A MassARRAY™ system (Sequenom, Inc.) was utilized to perform SNP genotyping in a 
high-throughput fashion. This genotyping platform was complemented by a homogeneous, single-tube 
assay method (hME™ or homogeneous MassEXTEND™ (Sequenom, Inc.)) in which two genotyping 
primers anneal to and amplify a genomic target surrounding a polymorphic site of interest. A third 
primer (the MassEXTEND™ primer), which is complementary to the amplified target up to but not 
including the polymorphism, was then enzymatically extended one or a few bases through the 
polymorphic site and then terminated. 

[0208] For each polymorphism, SpectroDESIGNER™ software (Sequenom, Inc.) was used to 
generate a set of PCR primers and a MassEXTEND^'^ primer which where used to genotype the 
polymorphism. Other primer design software could be used or one of ordinary skill in the art could 
manually design primers based on his or her knowledge of the relevant factors and considerations in 
designing such primers. Table 3 shows PCR primers and Table 4 shows extension primers used for 
analyzing polymorphisms. The initieil PCR amplification reaction was performed in a 5 ^i,l total volume 
containing IX PCR buffer with 1.5 mM MgCU (Qiagen), 200 yM each of dATP, dOTP, dCTP, dXTP 
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(Gibco-BRL), 2.5 ng of genomic DNA, 0.1 units of HotStar DNA polymerase (Qiagen), and 200 nM 
each of forward and reverse PCR primers specific for the polymorphic region of interest. 



TABLE 3: PCR Primers 



SNP 
Reference 


Forward PCR primer 


Reverse PCR primer 


rs756519 


ACGTTGGATGTCTAGAGACACCTGAGGTTG 


AGGTTGGATGTGTTTCACTTCAGAGCCCTG 


rsl 042327 


ACGTTGGATGAACTTCACATCACAGCTCCC 


ACGTTGGATGCAGAAGTTGGGTTTTCCAGC 


rs8770 


ACGTTGGATGCTGTGACTGGACACTTTTG 


ACGTTGGATGAAAATAGAGGTGCAGAGATG 


rs1 563055 


ACGTTGGATGAGTTCTTTGTCCTCACATTG 


AGGTTGGATGCCCTTTAGAAGCACATACTC 


rs912428 


ACGTTGGATGACTACATCCATTCCAGGGAG 


ACGTTGGATGTCAGATCAGAGTGAGTTTAG 


rs1 888475 


ACGTTGGATGACCCCTGGCAAGTGAATTAC 


AGGTTGGATGGGGAGGTGGATGTTCTTATC 



[0209] Samples were incubated at 95 °C for 15 minutes, followed by 45 cycles of 95°C for 20 
seconds, 56°C for 30 seconds, and 12°Q for 1 minute, finishing with a 3 minute final extension at 72°C. 
Following amplification, shrimp alkaline phosphatase (SAP) (0.3 imits in a 2 ^.1 volume) (Amersham 
Pharmacia) was added to each reaction (total reaction volume was 7 |iU) to remove any residual dNTPs 
that were not consimied in the PCR step. Samples were incubated for 20 minutes at 37°C, followed by 5 
minutes at 85°C to denature the SAP. 

[0210] Once the SAP reaction was complete, a primer extension reaction was initiated by adding a 
polymorphism-specific MassEXTEND™ primer cocktail to each sample. Each MassEXTEND™ 
cocktail included a specific combination of dideoxynucleotides (ddNTPs) and deoxynucleotides (dNTPs) 
used to distinguish polymorphic alleles fi-om one another. Methods for verifying, allelotjrping and 
genotyping SNPs are disclosed, for example, in U.S. Pat. No. 6,258,538, the content of which is hereby 
incorporated by reference, hi Table 4, ddNTPs are shown and the fourth nucleotide not shown is the 
dNTP. 



TABLE 4: Extension Primers 



SNP 
Reference 


Extend Probe 


Termination 
Mix 


rs756519 


CAGAGCCCTGTTCTTTGATTT 


ACG 


rsl 042327 


CATCACAGCTCCCCACCAT 


ACT 


rs8770 


TAGACACTGTGTAAGCAATC 


ACG 


rs1 563055 


TTCTCCTCACATTGTTTCTACT 


ACG 


rs912428 


CCATTCCAGGGAGACTCCCA 


ACT 


rs 1888475 


GAGATCAAATGATTCCCCTGT 


ACT 



[0211] The MassEXTEND™ reaction was performed in a total volume of 9 ^1, with the addition of 
IX ThermoSequenase buffer, 0.576 units of Thermo Sequenase (Amersham Pharmacia), 600 nM 
MassEXTEND™ primer, 2 mM of ddATP and/or ddCTP and/or ddGTP and/or ddTTP, and 2 mM of 
dATP or dCTP or dGTP or dTTP. The deoxy nucleotide (dNTP) used ui the assay normally was 
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complementary to the nucleotide at the polymorphic site in the amplicon. Samples were incubated at 
94°C for 2 minutes, followed by 55 cycles of 5 seconds at 94°C, 5 seconds at 52°C, and 5 seconds 
at 72°C. 

[0212] Following incubation, samples were desalted by adding 1 6 |al of water (total reaction volume 
was 25 )j,l), 3 mg of SpectroCLEAN™ sample cleaning beads (Sequenom, Inc.) and allowed to incubate 
for 3 minutes with rotation. Samples were then robotically dispensed using a piezoelectric dispensing 
device (SpectroJET™ (Sequenom, Inc.)) onto either 96-spot or 384-spot silicon chips containing a matrix 
that crystallized each sample (SpectroCHIP™ (Sequenom, Inc.)). Subsequently, MALDI-TOF mass 
spectrometry (Biflex and Autoflex MALDI-TOF mass spectrometers (Bruker Daltonics) can be used) and 
SpectroTYPER RT™ software (Sequenom, Inc.) were used to analyze and interpret the SNP genotype for 
each sample. 

Genetic Analysis 

[0213] Minor allelic frequencies for the polymorphisms set forth in Table A were verified as being 
10% or greater using the extension assay described above in a group of samples isolated from 92 
individuals originating from the state of Utah in the United States, Venezuela and France (Coriell cell 

repositories). 

[0214] Genotyping results are shown for female pools in Table 5. In Table 5, "AF" refers to allelic 
frequency; and "F case" and "F control" refer to female case and female control groups, respectively. 



TABLE 5: Genotyping Results 



SNP Reference 


AT 
F case 


AF 
F control 


p-value 


rs756519 


C = 0.581 
T = 0.419 


C = 0.656 
T = 0.344 


0.0055 


rs 1042327 


T = 0.472 
C = 0.528 


T = 0.563 
C = 0.437 


0.0012 


rs8770 


C = 0.529 
T = 0.471 


C = 0.432 
T = 0.568 


0.0001 


rs 1563055 


C = 0.653 
T = 0.347 


C = 0.736 
T = 0.264 


0.0013 


rs912428 


T = 0.228 
C = 0.772 


T = 0.170 
C = 0.830 


0.0076 


rsl 888475 


A = 0.188 
G = 0.812 


A = 0.135 
G = 0.865 


0.0087 



[0215] All of the single marker alleles set forth in Table A were considered validated, since the 
genotyping data agreed with the allelotyping data and each SNP significantly associated with 
osteoarthritis. Particularly significant associations with osteoarthritis are indicated by a calculated p- 
value of less than 0.05 for genotype results. 
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Example 3 

Association of Polymorphic Variants with Osteoarthritis in Replication Cohorts 

[0216] The single marker polymorphisms set forth in Table A were genotyped again in two 
replication cohorts consisting of individuals selected for OA. 

Sample Selection and Pooling Strategies - Replication Sample 1 

[0217] A second case control sample (replication sample #1) was created by using 100 Caucasian 
female cases from Chingford, UK, and 148 unrelated female cases from the St. Thomas twin study. 
Cases were defined as having Kellgren-Lawrence (KL) scores of at least 2 in at least one knee x-ray. In 
addition, 199 male knee replacement cases from Nottingham were included. (For a cohort description, 
see the Nottingham description provided in Example 1). The control pool was made up of unrelated 
female individuals from the St. Thomas twin study (England) with normal knee x-rays and without other 
indications of OA, regardless of anatomical location, as well as lacking family history of OA. The St. 
Thomas twin study consists of Caucasian, female participants from the St. Thomas' Hospital, London, 
adult-twin registry, which is a voluntary registry of >4,000 twin pairs ranging from 18 to 76 years of age. 
The replication sample 1 cohort was used to replicate the initial results. Table 6 below summarizes the 
selected phenotype data collected from the case and control individuals. 



TABLE 6 



Phenotype 


Female cases (n=248): 

median (range)/ (n,%) 


Male cases (n=199): 

median (range)/ (n,%) 


Female controls (n=313): 

mean (range)/ (n,%) 


Age 


59 (39- 73) 


66 (45- 73) 


55 (50- 72) 


Height (cm) 


162(141-178) 


175(152- 198) 


162(141-176) 


Weight (i^g) 


68 (51- 123) 


86(62-127) 


64(40-111) 


Body mass index 


26(18- 44) 


29 (21-41) 


24(18-46) 


Kellgren- 


0 (63. 26%). 1 (20. 8%), 2 






Lawrence* left 


(105. 43%). 3 (58. 23%). 4 


NA 


NA 


l^nee 


(1.0%) 






Kellgren- 


0(43,7%), 1 (18, 7%), 2 






Lawrence* right 


(127, 52%), 3(57, 23%), 4 


NA 


NA 


knee 


(1,0%) 






KL* >2 both 


No (145. 59%), Yes (101, 


NA 


NA 


knees 


41%) 


KL* >2 either 
knee 


No (0, 0%), Yes (248, 100%) 


NA 


NA 



* 0: normal, 1 : doubtful, 2: definite osteophyte (bony protuberance), 3: joint space 
narrowing (with or without osteophyte), 4: joint deformity 
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Sample Selection and Pooling Strategies - Replication Sample 2 

[0218] A third case control sample (replication sample #2) was created by using individuals with 
symptoms of OA from Newfoundland, Canada. These individuals were recruited and examined by 
rheumatologists. Affected joints were x-rayed and a fmal diagnosis of definite or probable OA was made 
according to American College of Rheumatology criteria by a single rheumatologist to avoid any inter- 
examiner diagnosis variability. Controls were recruited from volunteers without any symptoms from the 
musculoskeletal system based on a normal joint exam performed by a rheumatologist. Only cases with a 
diagnosis of definite OA were included in the study. Only individuals of Caucasian origin were included. 
The cases consisted of 228 individuals with definite knee OA, 106 individuals with definite hip OA, and 
74 individuals with hip OA. 



TABLE? 



Phenotype 


Case 


Control 


Age at Visit 


62.7 


52.5 


Sex (Female/Male) 


227/119 


174/101 


Knee OA Xray: No 

Unknown 
Yes 


35% (120) 


80% (16) 


1%(4) 


0% (0) 


64% (221) 


20% (4) 


Hip OA Xray: No 

Unl<nown 

Yes 


63% (215) 


80% (16) 


2% (7) 


0% (0) 


35% (121) 


20% (4) 



Assav for Verifying. Allelotvping. and Gcnotyping SNPs 

[0219] Genotyping of the replication cohorts described in Tables 6 and 7 was performed using the 
same methods used for the original genotyping, as described herein. A MassARRAY™ system 
(Sequenom, Inc.) was utilized to perform SNP genotyping in a high-throughput fashion. This genotyping 
platform was complemented by a homogeneous, single-tube assay method (hME™ or homogeneous 
MassEXTEND™ (Sequenom, Inc.)) in which two genotyping primers anneal to and amplify a genomic 
target surrounding a polymorphic site of interest. A third primer (the MassEXTEND™ primer), which is 
complementary to the amplified target up to but not including the polymorphism, was then enzymatically 
extended one or a few bases through the polymorphic site and then terminated. 

[0220] For each polymorphism, SpectroDESIGNER™ software (Sequenom, Inc.) was used to 
generate a set of PCR primers and a MassEXTEND™ primer which where used to genotype the 
polymorphism. Other primer design software could be used or one of ordinary skill in the art could 
manually design primers based on his or her knowledge of the relevant factors and considerations in 
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designing such primers. Table 3 shows PGR primers and Table 4 shows extension probes used for 
analyzing (_e.g., genotyping) polymorphisms in the replication cohorts. The initial PGR amplification 
reaction was performed inaS [i\ total volume containing IX PGR buffer with 1 .5 mM MgCla (Qiagen), 
200 fxM each of dATP, dGTP, dCTP, dTTP (Gibco-BRL), 2.5 ng of genomic DNA, 0.1 units of HotStar 
DNA polymerase (Qiagen), and 200 nM each of forward and reverse PGR primers specific for the 
polymorphic region of interest. 

[0221] Samples were incubated at 95°C for 15 minutes, followed by 45 cycles of 95''C for 20 
seconds, 56°G for 30 seconds, and 72°C for 1 minute, finishing with a 3 minute final extension at 72°C. 
Following amplification, shrimp alkaline phosphatase (SAP) (0.3 units in a 2 jjl volume) (Amersham 
Pharmacia) was added to each reaction (total reaction volume was 7 yA) to remove any residual dNTPs 
that were not consumed in the PGR step. Samples were incubated for 20 minutes at 37''C, followed by 5 
minutes at 85°C to denature the SAP. 

[0222] Once the SAP reaction was complete, a primer extension reaction was initiated by adding a 
polymorphism-specific MassEXTEND™ primer cocktail to each sample. Each MassEXTEND™ 
cocktail included a specific combination of dideoxynucleotides (ddNTPs) and deoxynucleotides (dNTPs) 
used to distinguish polymorphic alleles fi-om one another. Methods for verifying, allelotyping and 
genotyping SNPs are disclosed, for example, in U.S. Pat. No. 6,258,538, the content of which is hereby 
incorporated by reference. In Table 7, ddNTPs are shown and the fourth nucleotide not shown is the 
dNTP. 

[0223] The MassEXTEND™ reaction was performed in a total volxmie of 9 (xl, with the addition of 
IX ThermoSequenase buffer, 0.576 mats of ThermoSequenasc (Amersham Pharmacia), 600 nM 
MassEXTEND™ primer, 2 mM of ddATP and/or ddCTP and/or ddGTP and/or ddTTP, and 2 mM of 
dATP or dCTP or dGTP or dTTP. The deoxy nucleotide (dNTP) used in the assay normally was 
complementary to the nucleotide at the polymorphic site in the amplicon. Samples were incubated at 
94°C for 2 minutes, followed by 55 cycles of 5 seconds at 94°C, 5 seconds at 52*'C, and 5 seconds 
at 72°C. 

[0224] Following incubation, samples were desalted by adding 1 6 ^1 of water (total reaction volume 
was 25 nl), 3 mg of SpectroCLEAN™ sample cleaning beads (Sequenom, Inc.) and allowed to incubate 
for 3 minutes with rotation. Samples were then robotically dispensed using a piezoelectric dispensing 
device (SpectroJET™ (Sequenom, Inc.)) onto either 96-spot or 384-spot silicon chips containing a matrix 
that crystallized each sample (SpectroCHIP™ (Sequenom, Inc.)). Subsequently, MALDI-TOF mass 
spectrometry (Biflex and Autoflex MALDI-TOF mass spectrometers (Bruker Daltonics) can be used) and 
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SpectroTYPER RT™ software (Sequenom, Inc.) were used to analyze and interpret the SNP genotype for 
each sample. 

Genetic Analysis 

[0225] Genotyping results for replication cohorts #1 and #2 are provided in Tables 8 and 9, 
respectively. 

TABLES 



rsID 


Replication #1 
(Mixed IMale/Female cases and Female controls) 

AFOACon AFOACas Delta P-value 


Meta-analysis 
Disc. * Rep #1 
P-value 


rs756519 


0.4 


0.43 


-0.04 


0.140 


0.0098 


rs1 042327 


0.49 


0.52 


-0.03 


0.234 


0.0430 


rs8770 


0.51 


0.48 


0.03 


0.303 


0.0480 


rs1 563055 


0.31 


0.35 


-0.04 


0.083 


0.0002 


rs912428 


0.86 


0.8 


0.06 


0.004 


-0.00001 


rs1 888475 


0.86 


0.81 


0.04 


0.032 


0.0002 



TABLE 9 



rsID 


Replication #2 (Newfoundland) 
(Male/Female cases and controls) 

AFOACon AFOACas Delta P-value 


Meta-analysis 
Disc. ••■ Rep #2 
Not Done 


rs756519 


0.39 


0.40 


-0.007 


0.816 




rs1 042327 


0.49 


0.51 


-0.024 


0.405 




rs8770 


0.53 


0.49 


0.039 


0.195 




rsl 563055 


0.34 


0.34 


-0.005 


0.864 




rs912428 


0.82 


0.76 


0.058 


0.016 




rsl 888475 


0.80 


0.82 


-0.025 


0.280 





[0226] To combine the evidence for association from multiple sample collections, a meta-analysis 
procediire was employed. The allele frequencies were compared between cases and controls within the 
discovery sample, as well as within the replication cohort #1 using the DerSimian-Laird approach 
(DerSimonian, R. and N. Laird. 1986. Meta-analysis in clinical trials. Control Clin Trials 7: 177-188.) 

[0227] The absence of a statistically significant association in one or more of the rephcation cohorts 
should not be interpreted as minimizing the value of the original fmding. There are many reasons why a 
biologically derived association identified in a sample from one population would not replicate in a 
sample from another population. The most important reason is differences in population history. Due to 
bottlenecks and founder effects, there may be common disease predisposing alleles present in one 
population that are relatively rare in another, leading to a lack of association in the candidate region. 
Also, because common diseases such as arthritis-related disorders are the result of susceptibilities in 
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many genes and many environmental risk factors, differences in population-specific genetic and 
environmental backgrounds could mask the effects of a biologically relevant allele. For these and other 
reasons, statistically strong results in the original, discovery sample that did not replicate in one or more 
of the replication samples may be further evaluated in additional replication cohorts and experimented 
systems. 

Example 4 
Chromosome 6 Region Proximal SNPs 

[0228] It has been discovered that SNPs rs756519, rs 1042327 and rs8770 on chromosome 6 (6q27) 
are associated with occurrence of osteoarthritis in subjects. This region contains genes that encode 
proteasome (prosome, macropain) subunit, beta type, 1 (PSMBl), TATA box binding protein (TBP), and 
programmed cell death 2 (PDCD2). 

[0229] One himdred-nine additional allelic variants proximal to rs756519, rsl042327 and rs8770 
were identified and subsequently allelotOyped in osteoarthritis case and control sample sets as described 
in Exemiples 1 and 2. The polymorphic variants are set forth in Table 10. The chromosome positions 
provided in column four of Table 10 are based on Genome "Build 34" of NCBI's GenBank. 



TABLE 10 



dbSNP 
rs# 


Chromosome 


Position in SEQ 
ID NO: 1 


Cliromosome 
Position 


Allele 
Variants 


rs1 474555 


6 


229 


170689279 


C/t 


rs1 474554 


6 


6310 


170695360 


a/q 


re10334 


6 


11840 


170700890 


aft 


rs10541 


6 


11870 


170700920 


a/t 


rs3823299 


6 


12064 


170701114 


a/q 


RS742348 


6 


13392 


170702442 


c/q 


rs1 474644 


6 


16354 


170705404 




rs1 474643 


6 


16559 


170705609 


c/t 


rs2056970 


6 


16935 


170705985 


a/q 


rs2223474 


6 


17616 


170706666 


c/t 


rs2206284 


6 


17737 


170706787 


c/t 


RS756519 


6 


18321 


170707371 


c/t 


RS756518 


6 


18453 


1 70707503 


a/q 


RS756517 


6 


18811 


1 70707861 


c/t 


rs1 474642 


6 


20020 


1 70709070 


c/t 


rs2038093 


6 


21662 


170710712 


c/q 


rs2038092 


6 


23197 


170712247 


c/q 


rs2223473 


6 


23446 


170712496 


q/t 


RS760909 


6 


24339 


170713389 


fl/t 


rs2076319 


6 


25504 


170714554 


a/q 


rs3778589 


6 


27174 


170716224 


a/q 


rs3800236 


6 


28008 


170717058 


a/t 
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dbSNP 


Chromosome 


Position in SEQ 


Cliromosome 


Allele 


rs# 




ID NO: 1 


Position 


Variants 


rs2206286 


6 


29294 


170718344 


c/t 


rs12717 


6 


29759 


170718809 


da 


rs21 79373 


6 


30832 


170719882 


a/g 


rs3800235 


6 


44512 


170733562 


a/c 


rs38232g8 


6 


44850 


170733900 


da 


rs2076318 


6 


45884 


170734934 


a/g 


rs2235506 


6 


46345 


170735395 


c/t 


rs2072916 


6 


48589 


170737639 


a/q 


rs3734763 


6 


53371 


170742421 


a/q 


rs3 177571 


6 


53911 


170742961 


q/t 


rs8770 


6 


53990 


170743040 


a/q 


rs31 73219 


6 


55152 


170744202 


da 


RS960744 


6 


55667 


170744717 


c/t 


rs2066954 


6 


58952 


170748002 


a/c 


rs2072917 


6 


59315 


170748365 


q/t 


rs3 173220 


6 


60029 


170749079 


a/q 


RS734249 


6 


61477 


170750527 


a/c 


rs2092310 


6 


62988 


170752038 


c/t 


rs2092309 


6 


63090 


170752140 


c/q 


rs1016536 


6 


64021 


170753071 


a/c 


rs2235506 


6 


65685 


170754735 


c/t 


rs2076998 


6 


70220 


170759270 


a/q 


rs2076997 


6 


70323 


170759373 


a/c 


rs2345478 


6 


70959 


170760009 


a/c 


rs2021899 


6 


73436 


170762486 


c/q 


rs2021898 


6 


82945 


170771995 


a/q 


rs2345682 


6 


82958 


170772008 


q/t 


rs2345683 


6 


82961 . 


170772011 


c/q 


rs2881195 


6 


82964 


170772014 


c/t 


rs2345684 


6 


82965 


170772015 


Q/t 


rs3046261 


6 


83006 


170772056 


-/cttt 


rs4083413 


6 


83025 


170772075 


c/t 


rs4083412 


6 


83034 


170772084 


a/g 


rs2345685 


6 


83074 


170772124 


g/t 


rs2021897 


6 


83132 


170772182 


Q/t 


rs4036211 


6 


83155 


170772205 


c/t 


rs4036212 


6 


83172 


170772222 


a/t 


rs4036213 


6 


83174 


170772224 


q/t 


rs2345686 


6 


83206 


170772256 


c/t 


rs4036214 


6 


83216 


170772266 


q/t 


rs4036215 


6 


83234 


170772284 


q/t 


rs2345687 


6 


83252 


170772302 


a/q 


rs2345688 


6 


83260 


170772310 


a/c 


rs2881196 


6 


83263 


170772313 


a/c 


rs3046288 


6 


83296 


170772346 


-/at 


rs4036216 


6 


83319 


170772369 


a/Q 


rs4036205 


6 


83322 


170772372 


C/Q 
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dbSNP 
rs# 


Chromosome 
romosome 


Position in SEQ 
ID NO: 1 


Chromosome 
Position 


Ailele 
Variants 


rs2092307 


6 


83324 


170772374 


a/c 


rs4036206 


6 


83357 


170772407 


C/Q 


rs2345689 


6 


83375 


170772425 


c/t 


rs2345690 


6 


83381 


170772431 


c/t 


rs2345691 


6 


83389 


170772439 


a/t 


rs2345692 


6 


83443 


170772493 


a/a 


rs3046306 


6 


83499 


170772549 


-/ggtg 


rs4G36207 


6 


83545 


170772595 


c/t 


rs2345693 


6 


83566 


170772616 


c/t 


rs2345694 


6 


83591 


170772641 


c/t 


rs2345695 


6 


83619 


170772669 


Q/t 


rs2345696 


6 


83698 


170772748 


a/g 


rs4036209 


6 


83780 


170772830 


g/t 


rs2345697 


6 


83784 


170772834 


q/t 


rs2881197 


6 


83826 


170772876 


q/t 


rs2345698 


6 


83832 


170772882 


c/t 


rs2345699 


6 


83852 


170772902 


c/t 


rs2744640 


6 


86297 


170775347 


c/t 


rs2744639 


6 


86315 


170775365 


q/t 


rs2744638 


6 


86420 


170775470 


c/q 


rs2744637 


6 


86460 


170775510 


c/q 


rs2744636 


6 


86714 


170775764 


c/t 


rs2744635 


6 


86718 


170775768 


c/t 


rs2744634 


6 


86736 


170775786 


c/q 


rs2744633 


6 


86753 


170775803 


c/t 


rs2744632 


6 


86766 


170775816 


q/t 


rs2744630 


6 


88162 


170777212 


c/q 


rs2744629 


6 


88218 


170777268 


a/g 


rs2744628 


6 


88246 


170777296 


a/g 


rs2744627 


6 


88255 


170777305 


c/t 


rs2977616 


6 


88309 


170777359 


g/t 


rs2977617 


6 


88310 


170777360 


a/t 


rs2744626 


6 


88471 


170777521 


a/g 


rs2744625 


6 


88619 


170777669 


c/t 


rs31 15847 


6 


88904 


170777954 


c/t 


rs2744623 


6 


89044 


170778094 


c/g 


rs4036193 


6 


90531 


170779581 


-/aaaaa 


rs4036194 


6 


90534 


170779584 


a/q 


rs4036196 


6 


90613 


170779663 


c/g 


RS1042327 


6 


46252 


170735302 


c/t 
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Assay for Verifying and Allelotyping SNPs 

[0230] The methods used to verify and allelotype the 109 proximal SNPs of Table 10 are the same 
methods described in Examples 1 and 2 herein. The primers and probes used in these assays are provided 
in Table 1 1 and Table 12, respectively. 



TABLE 11 



dbSNP 


Forward 

PCR primer 


PCR^rimer 

primer 


Rs 1474555 


ACGTTGGATGACATCAACTGAAGCCGACAG 


ACGTTGGATGAATGGTGGAATGTGATGAGA 


Rs 1474554 


ACGTTGGATGATACACCTAGGACACCTCCA 


ACGTTGGATGCAGAAGGAGATAAACCCAGC 


rs10334 


ACGTTGGATGAACAGTTTCCTCCCTGATGC 


ACGTTGGATGCGGCTGGTGAAAGATGTCTT 


rs10541 


ACGTTGGATGACTATGCAGATCCGGAGTGC 


ACGTTGGATGGTCCTTGGACAGAGCCATG 


Rs3823299 


ACGTTGGATGCTCATGTGTACGAGGATTTG 


ACGTTGGATGGTCTGGAAGGGTCTTTATTC 


rs742348 


ACGTTGGATGTGTGGA 1 1 1 1 CCAGTGCTCG 


ACGTTGGATGCTGTACTTGAACTCCCAAGC 


Rs 1474644 


ACGTTGGATGGCAAGACAAGCATAATTGGG 


ACGTTGGATGTAAAGGGCATTTTGGCTTCC 


Rs 1474643 


ACGTTGGATGTCTCCCAAATTAAAAGTGGC 


ACGTTGGATGGATACCAAAGTCCTACTTAC 


Rs2056970 


ACGTTGGATGTGGGACTACAGGAAGAGAAG 


ACGTTGGATGCAAAACACAGACCTTCAGCC 


Rs2223474 


ACGTTGGATGCCAGGGTAAAGAAAAGATCC 


ACGTTGGATGAGAGGCTTACCTCCTAAAAG 


Rs22G6284 


ACGTTGGATGTCACATACTAGGTGGATCCC 


ACGTTGGATGAAAGAGGAGAACACAGGATG 


rs756519 


ACGTTGGATGTCTAGAGACACCTGAGGTTG 


ACGTTGGATGTGTrTCACTTCAGAGCCCTG 


rs756518 


ACGTTGGATGCCCAGATTAGACTCTCTAAC 


ACGTTGG ATG AAATAGCTG AGCTGCC ATTG 


rs756517 


ACGTTGGATGCTCGGTTGTTGACTCCTATC 


ACGTTGGATGGCGGATGTTAAGAGTCAGAG 


Rs 1474642 


ACGTTGGATGGGAGGTCATACATTAGCTTC 


ACGTTGGATGTACCATCTGACAGAATTCTG 


Rs2038093 


ACGTTGGATGGAGACAGAGTTTCACTCTTG 


ACGTTGGATGTAATCACTrGAACCCAGGAG 


Rs2038092 


ACGTTGGATGTTACCTGAGGTCAGGAGTTT 


ACGTTGGATGCCACACCCAGCTGA i i i i i G 


Rs2223473 


ACGTTGGATGCCTTTATGTTATTGCTTTCC 


ACGTTGGATGCAGGGAAATTTAAGAATAGC 


rs760909 


ACGTTGGATGGGAAGAGGCAAGCTTAGTTC 


ACGTTGGATGGCAGCATTAACGAATGCCTG 


Rs2076319 


ACGTTGGATGGACATTTCACAATGCCTTTG 


ACGTTGGATGCCAACAGCAACTTAAAAACTC 


Rs3778589 


ACGTTGGATGGCAAGAGAGAGAAAAGTTCC 


ACGTTGGATGGTGTTTCTGTGCCATTTCAC 


Rs3800236 


ACGTTGGATGAGAGAATGAGGCCTCA 1 I I I 


ACGTTGGATGCTCAGTCATTGTTC 1 1 1 1 1 C 


Rs2206286 


ACGTTGGATGTTCAGAGGCTAACGGTGTAC 


AGGTTGGATGAACATAGCCTCTGCTCTGTG 


rs12717 


ACGTTGGATGAAAATCGCAGCTGCAAAGGG 


ACGTTGGATGAGACAGCAAGTGTCGGATCC 


Rs2 179373 


ACGTTGGATGGAAGTGACCTATGCTCAGAC 


ACGTTGGATGAATGTCACTTCCGCCAGTTC 


RS3800235 


ACGTTGGATGCTATGTGTTGATACCTCCAAG 


ACGTTGGATGGCTTCATAAATGAACTGAAC 


RS3823298 


ACGTTGGATGGGTGGTTTCTTGTCTTGATG 


ACGTTGGATG 1 1 1 1 1 GTCCCAGAGCATCTG 


RS2076318 


ACGTTGGATGTCGGCCAAATTATTGTAGCC 


ACGTTGGATGCTCAGTAGAAATGCATGGGC 


RS2235506 


ACGTTGGATGTAACCATGTCAACTGTTCTC 


ACGTTGGATGCCCACCAACAATTTAGTAGG 


RS2072916 


ACGTTGGATGACGCTGGAGTCACTAAGATG 


ACGTTGGATGCAGATTAAGGCACAGGCATG 


RS3734763 


ACGTrGGATGGCGTTTTGCCTTrCAGTGTC 


ACGTTGGATGTAAAGAGGGTGGACCTTCAG 


Rs31 77571 


ACGTTGGATGGTCTGTTGTCAATATAGGTG 


ACGTTGGATGACAAAAGTGTCCAGTGACAG 


rs8770 


ACGTTGGATGAATTCCCTGTCACTGGACAC 


ACGTTGGATGCCAAAAATAGAGGTGCAGAG 


Rs3173219 


ACGTTGGATGACATAACCACACTGGAGGTG 


ACGTTGGATGCCTAGTTTTCAGACACGGTC 


rs960744 


ACGTTGGATGAAAGGGATGTCACAGTTCCC 


ACGTTGGATGGCCCTCTGAGTCAGATAAAC 


RS2066954 


ACGTTGGATGGAGGTTCTGGGTATAACTTTC 


ACGTTGGATGCTACAAACCAGTAAGCTGATG 


Rs2072917 


ACGTTGGATGTGCTAGGCACTCACACTATC 


ACGTTGGATGAGGCTTGGTAAGTTCCTCTG 


Rs31 73220 


ACGTTGGATGTATCTGGGTTGACAAAGGCG 


ACGTTGGATGACATAAGCAGGCTTGTGCAC 


rs734249 


ACGTTGGATGAGGTGGACACCAGCAGGGAA 


ACGTTGGATGTCACCTCTGCACATGTCTTG 
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dbSNP 

rs# 


Forward 
PCR primer 


Reverse 
PCR primer 


RS2092310 


ACGTTGGATGTTAGTCAGGTAAAGCGGGAC 


ACGTTGGATGTCAGTGGAAGGCTGATCAAG 


RS2092309 


ACGTTGGATGATCTAATTGCTTCCCCTCCC 


ACGTTGGATGCAGCCTTCGACTGAATACAC 


Rs1016536 


ACGTTGGATGCCCCAAAAATTGGAGACAGG 


ACGTTGGATGGGCTGTCATAATCGTGTGTC 


Rs2235506 


ACGTTGGATGAAGTGATTCTCCTGCCTCAG 


ACGTTGGATGTGGTGAAACCCTGTCTCTAC 


RS2076998 


ACGTTGGATGGCTCTGTGATTTCGATGATG 


ACGTTGGATGAGCTACTTCTTGCAGGAGTC 


RS2076997 


ACGTTGGATGCAGAGCTTCCAAGTGTTTTC 


ACGTTGGATGAAAGGAGTGCTTAAAGGAGC 


RS2345478 


ACGTTGGATGCCTTCAACAAGTGCTGACAC 


ACGTTGGATGATCCAGGCATTATTGCCAGC 


RS2021899 


ACGTTGGATGGTTTTGTGGTGGATGATGGG 


ACGTTGGATGAGAGTGCCCATAATGGACAG 


Rs2021898 


ACGTTGGATGCGCAAGAAACTCCTTGGATG 


ACGTTGGATGCCAATTAAAGCCAAGGTCAC 


Rs2345682 


ACGTTGGATGATTCGCAAGAAACTCCTTGG 


ACGTTGGATGGGAAGAAATCTTACCAGAAC 


RS2345683 


ACGTTGGATGATTCGCAAGAAACTCCTTGG 


ACGTTGGATGGGAAGAAATCTTACCAGAAC 


RS2881195 


ACGTTGGATGATTCGCAAGAAACTCCTTGG 


ACGTTGGATGGGAAGAAATCTTACCAGAAC 


Rs2345684 


ACGTTGGATGATTCGCAAGAAACTCCTTGG 


ACGTTGGATGGGAAGAAATCTTACCAGAAC 


RS3046261 


ACGTTGGATGCTCCACTCAGACATCAAAAG 


ACGTTGGATGGTGACCTTGGCTTTAATTGG 


RS4083413 


ACGTTGGATGGTGACCTTGGCTTTAATTGG 


ACGTTGGATGCTCCACTCAGACATCAAAAG 


RS4083412 


ACGTTGGATGGTGACCTTGGCTTTAATTGG 


ACGTTGGATGCTCCACTCAGACATCAAAAG 


Rs2345685 


ACGTTGGATGGTTCTGGTAAGATTTCTTCC 


ACGTTGGATGAGTCTTACAATAGATGACTG 


RS2021897 


ACGTTGGATGGCAATTATTTACAGAAGCCC 


ACGTTGGATGTCCCACACAGTCATCTATTG 


RS4036211 


ACGTTGGATGCCCATTACAAGTTGGGCAGTT 


ACG 1 1 GGA 1 GC II 1 C 1 GA 1 1 CC 1 II II 1 1 1 CC 


RS4036212 


ACG 1 1 GGA 1 GC M I CI GA 1 1 CC 1 1 1 1 1 1 1 1 CO 


ACGTTGGATGCCCATTACAAGTTGGGCAGTT 


RS4036213 


ACGTTGGATGCCCATTACAAGTTGGGCAGTT 


ACGTTGGATGCTTTCTGATTCC 1 1 1 1 1 1 II CC 


RS2345686 


ACGTTGGATGCCCATTACAAGTTGGGCAGTT 


ACG 1 1 GGA 1 GC 1 1 1 CI GA 11 CCTTI TTTTTCC 


RS4036214 


ACGTTGGATGCCCATTACAAGTTGGGCAGTT 


ACGI IGGAIGCriiCIGAI ICCI 1 1 II 1 1 ICC 


RS4036215 


ACGI IGGAIGCI 1 ICIGAI ICCI 1 1 1 1 1 1 ICC 


ACGTTGGATGCCCATTACAAGTTGGGCAGTT 


RS2345687 


ACGTTGGATGGGATTGTAAGGTGAGACTTG 


ACGTTGGATGTTCCTCCCCATTACAAGTTG 


RS2345688 


ACGTTGGATGAGGGTCCCATCTAAGAATTC 


ACGTTGGATGGGATTGTAAGGTGAGACTTG 


RS2881196 


ACGTTGGATGAGGGTCCCATCTAAGAATTC 


ACGTTGGATGGGATTGTAAGGTGAGACTTG 


RS3046288 


ACGTTGGATGCCAACTTGTAATGGGGAGGA 


ACGTTGGATGCAGTTTTTACAGAGGGTCCC 


RS4036216 


ACGTTGGATGCTTGTAATGGGGAGGAAAAAA 


ACGTTGGATGTTCTCATTTTAATCTGTCAG 


Rs4036205 


ACGTTGGATGCTTGTAAtGGGGAGGAAAAAA 


ACGTTGGATGTTCTCATTTTAATCTGTCAG 


RS2092307 


ACGTTGGATGCTTGTAATGGGGAGGAAAAAA 


ACGTTGGATGTTCTCATTTTAATCTGTCAG 


Rs4036206 


ACGTTGGATGGACCCTCTGTAAAAACTGAC 


ACGTTGGATGCCACTGCACCTCAAATCTTC 


Rs2345689 


ACGTTGGATGTTCCCTGAGTATCTCCCATG 


ACGTTGGATGGGGACCCTCTGTAAAAACTG 


RS2345690 


ACGTTGGATGTTCCCTGAGTATCTCCCATG 


ACGTTGGATGGGGACCCTCTGTAAAAACTG 


RS2345691 


ACGTTGGATGGCCACCTGTTGGAGATTTAC 


ACGTTGGATGGGGACCCTCTGTAAAAACTG 


Rs2345692 


ACGTTGGATGTACATGGGAGATACTCAGGG 


ACGTTGGATGCCACTGCACCTCAAATCTTC 


RS3046306 


ACGTTGGATGGTATAACAAACCTTACCCTTG 


ACGTTGGATGTAAAGAAAGAAGATTTGAGG 


RS4036207 


ACGTTGGATGTATCAATGGAGAATGCGTGG 


ACGTTGGATGGGGAGTTAACCAGCAAAAGC 


Rs2345693 


ACGTTGGATGTCGACAACAAGAAGAGAAGG 


ACGTTGGATGCACATTAGACAAGGGTAAGG 


Rs2345694 


ACGTTGGATGCTACCTCTCTCGACAACAAG 


ACGTTGGATGCTTAAGTCCACGCATTCTCC 


RS2345695 


ACGTTGGATGCGCATTCTCCATTGATAAGAC 


ACGTTGGATGCCATTTAAAAGCTACCTCTC 


Rs2345696 


ACGTTGGATGCCTTACACAAGTGTAACTTC 


ACGTTGGATGCCCCAAAATATAATGGTAGG 


RS4036209 


ACGTTGGATGGGAACACAGTGTATAAGACC 


ACGTTGGATGGTTTTCACAACTTCGTTAGC 


RS2345697 


ACGTTGGATGGTTTTCACAACTTCGTTAGC 


ACGTTGGATGGCCACCCCAAAATATAATGG 


Rs2881197 


ACGTTGGATGGCTGGAGGAAAAACAAGAAC 


ACGTTGGATGCCTACCATTATATTTTGGGG 


Rs2345698 


ACGTTGGATGCTGGAGGAAAAACAAGAACTC 


ACGTTGGATGCATTATATTTTGGGGTGGCAT 


Rs234569g 


ACGTTGGATGGCTGGAGGAAAAACAAGAAC 


ACGTTGGATGGGGTGGCATATTTTGGTCTT 
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dbSNP 
rs# 


Forward 
PCR primer 


Reverse 
PCR primer 


RS2744640 


ACGTTGGATGGCAACAGCACTTAGTATGCC 


ACGTTGGATGTGTGAAGCTGCAAATCTGGC 


RS2744639 


ACGTTGGATGGCAACAGCACTTAGTATGCC 


ACGTTGGATGTGTGAAGCTGCAAATCTGGC 


RS2744638 


ACGTTGGATGAACCGTGGCAATACCACGTC 


ACGTTGGATGTGGGTTTGGGCTGGATTTGG 


RS2744637 


ACGTTGGATGTGAGTTGACAGCCTCTGCTGG 


ACGTTGGATGCACGTCAGTAAGGCAGAGAC 


RS2744636 


ACGTTGGATGTCGGAGATGACATTGTCACC 


ACGTTGGATGTTCCAGGGGTTACGTGTGTG 


Rs2744635 


ACGTTGGATGTGAGTCTGACTGTGTCACGG 


ACGTTGGATGTCGGAGATGACATTGTCACC 


Rs2744634 


ACGTTGGATGCGTGTTCCAGGGATTATATG 


ACGTTGGATGGCACATAACGCTTGGAACTC 


RS2744633 


ACGTTGGATGTATGAGTGTGACGGGTGTAG 


ACGTTGGATGGCACATAACGCTTGGAACTC 


Rs2744632 


ACGTTGGATGTAGCTGCCTTCCACATCCAA 


ACGTTGGATGTGTGACGGGTGTAGCGTTAG 


RS2744630 


ACGTTGGATGGGGTTCAAATGCCTCTGATAG 


ACGTTGGATGGGTCTAGGACAAGACCCATT 


RS2744629 


ACGTTGGATGAACTTTCCCTTAGCCAGTGG 


ACGTTGGATGATCAGAGGCATTTGAACCCC 


Rs2744628 


ACGTTGGATGTTGACCTCAAATCATGTCAC 


ACGTTGGATGTATCAGAGGCATTTGAACCC 


RS2744627 


ACGTTGGATGG GGTGGTTTATGTTCCACTG 


ACGTTGGATGCCAGAACTAATGCTAGCTTC 


RS2977616 


ACGTTGGATGTTGCACTGGCTAAGAGAAAG 


ACGTTGGATGCCAGAACTAATGCTAGCTTC 


RS2977617 


ACGTTGGATGCCAGAACTAATGCTAGCTTC 


ACGTTGGATGTTGCACTGGCTAAGAGAAAG 


Rs2744626 


ACGTTGGATGACAGTGAAATTGTATTTCCG 


ACGTTGGATGGCACAAACTTAAGAATCTCC 


RS2744625 


ACGTTGGATGAGCAAAATCCACCTATGTCC 


ACGTTGGATGCTGAATTTTGTCTCCAGTAC 


Rs31 15847 


ACGTTGGATGTCGAGGCAGAGGCGTAGTA 


ACGTTGGATGATAG GAATG ACATG AACCCG 


RS2744623 


ACGTTGGATGACGCGAGTCCGTAGGTGCTG 


ACGTTGGATGAAGAGGCTGCTACCCAGAG 


RS4036193 


ACGTTGGATGAGAGCAAGACTCCGTCTCAA 


ACGTTGGATGACATGTCGCTTGATGTGTGC 


RS4036194 


ACGTTGGATGACATGTCGCTTGATGTGTGC 


ACGTTGGATGAGAGCAAGACTCCGTCTCAA 


RS4036196 


ACGTTGGATGCCCCAGCGTTCATATTTGTC 


ACGTTGGATGTCTGGCCAAATGGTCATACC 


rs1042327 


ACGTT6GATGAACTTCACATCACAGCTCCC 


ACGTTGGATGCAGAAGTTGGGTTTTCCAGC 



TABLE 12 



dbSNP 


Extend 


Term 


rs# 


Primer 


Mix 


rs1 474555 


TGAAGCCGACAGTGACACC 


ACT 


rs1 474554 


CCAATTTTGCACACCTCCAGCA 


ACG 


rs10334 


CAGATCCGGAGTGCGTCC 


CGT 


rs10541 


TCTCTCTCAGCCGCAGAA 


CGT 


rs3823299 


GAGGATTTGTGATGAAAATACTA 


ACG 


rs742348 


AATCCCCGTGTTGTTCAAGG 


ACT 


rs1 474644 


AAGGATGTTCATCATAGTGTTTA 


ACG 


rs1 474643 


ACATGTTTATACATACACTCATG 


ACG 


rs2056970 


TTGGCAGCI 1 1 1 lAGGCCTC 


ACT 


rs2223474 


AAGTCTCAAAAAGGTCCC 


ACT 


rs2206284 


TAGGTGGATCCCTTTTCCC 


ACG 


rs756519 


CAGAGCCCTGTTCTTTGATTT 


ACG 


rs756518 


CAAAGGATGCTGTCTGGCC 


ACG 


rs756517 


GTTCCATGAGCGTTTTCTTTG 


ACG 


rs 1474642 


CTTCAGTTTCTTCATCACTTTC 


ACT 


rs2038093 


TTTCACTCTTGTTGCCCAGG 


ACT 


rs2038092 


CCAACATGGTGAAACCCCATCT 


ACT 


rs2223473 


TAGAATTAAAATTAGACTTTGGGG 


ACT 
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dbSNP 


Extend 


Term 








rs760909 


GCAAGCTTAGTTCTAGGTCAG 


CGT 


rs2076319 


TCACAATGCCTTTGTAATGATTT ' 


ACT 


rs3778589 


GIN lAGGAAGACTGCTCTGACAA 


ACG 


rs3800236 


CTGAGAGCCAGCTGCAGTAA 


CGT 


rs2206286 


CCTCGCCGGCTGGCATAA 


ACT 


rs12717 


CCATCCCCAAGTCTCTGCCAG 


ACT 


rs21 79373 


TGACCTATGCTCACACTTCTCA 


ACG 


rs3800235 


GTGTTGATACCTCCAAGTACATTT 


CGT 


rs3823298 


CTTGATGAAATAGTCATCCAACTA 


ACT 


rs2076318 


TGAATTATCACCATCATCA 


ACT 


rs2235506 


TGTTGCCAATAACAATCA 


ACG 


rs2072916 


TGTGACAAGGGATTCCAC 


ACG 


rs3734763 


CATCTGTAAGCAGGGCCGC 


ACG 


rs31 77571 


AAGACTGTGTAGCCTTCCTCTG 


ACT 


rs8770 


GTAG AC ACTGTGTAAG C AATC 


ACG 


rs3173219 


CACTGGAGGTGGAGAGCA 


ACT 


rs960744 


CGCCATCAGACCTGGCTGT 


ACT 


rs2066954 


TTACAATTTGAGCCTTGAGC 


CGT 


rs2072917 


CTATCCCGACCCGAGAAAC 


CGT 


rs31 73220 


GCGATGAAACTGAACTGA 


ACT 


rs734249 


CACCAGCAGGGAAGGTTTG 


CGT 


rs2092310 


TTGAGGTGAGGGCTTGGAG 


ACT 


rs2092309 


TCCCCTCCCCTATTGTTTAC 


ACT 


rs1016536 


AAATTGGAGACAGGTCTCAGT 


ACT 


rs2235506 


CTGGGAGTACAGGTGCGC 


ACT 


rs2076998 


G M 1 1 1 GTATAGTCTGCAGATGG 


ACT 


rs2076997 


ATCCATTTTAATGGGTTG CTAG CT 


ACT 


rs2345478 


ACAAGTGTACTTATTGGGGATA 


ACT 


rs2021899 


GTTTCTTGGAAACTCTTCCCA 


ACT 


rs2021898 


TTGGATGGGGTTAATGGCAG 


ACG 


rs2345682 


GTTAATGGGAGCTGTA 1 1 1 1 1 CTG 


ACT 


rs2345683 


GGCAGCTGTAI 1 1 1 ICTGTGA 


ACT 


rs2881195 


CAGCTGTA 1 1 1 1 1 CTGTGACCT 


ACG 


rs2345684 


GCAGCTGTA 1 1 1 1 1 CTGTGACCTT 


ACT 


rs3046261 


GAAAACATTTGAGATACTGAAGAT 


ACT 


rs4083413 


TTCCTTTATCTTCAGTATCTCAA 


ACT 


rs4083412 


TCTTCAGTATCTGAAATG 1 1 1 ICA 


ACG 


rs2345685 


CAACTTTTGATGTCTGAGTGGA 


ACT 


rs2021897 


ATTATTTACAGAAGCCCTATTCA 


ACT 


rs4036211 


TTTCGAAACAAAAGCTACCATGCA 


ACT 


rs4036212 


AAATAATTGCATGGTAGCTTTTG 


CGT 


rs4036213 


ACAACTACTTTGATGTTATTTCC 


CGT 


rs2345686 


ACAATCCAAAAATCACATTGCTA 


ACT 


rs4036214 


GTCTCACCTTACAATCCAAAAAT 


CGT 
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rs4036215 


AATGTGA 1 1 1 1 1 GGATTGTAAGG 


ACT 


rs2345687 


AAGGTGAGACTTGTTTAGCTTT 


ACT 


rs2345688 


TCCTCCCCATTACAAGTTGGGCA 


ACT 


rs2881196 


TTTTCCTCCCCATTACAAGTTGG 


ACT 


rs3046288 


TAATG G G G AG G AAAAAAATTTTCT 


ACT 


rs4036216 


ATG 1 1 1 1 1 G G AATTCTTAG ATG G 


ACT 


rs4036205 


G 1 1 1 1 1 GGAATTCTTAGATGGGAC 


ACT 


rs2092307 


TGGAATTCTTAGATGGGACCC 


ACT 


rs4036206 


ACTGACAGATTAAAATGAGAAAAA 


ACT 


rs234568g 


TCCCATGTATGCATAAGGTATAC 


ACT 


rs2345690 


GTATCTCCCATGTATCCATAAG 


ACT 


rs2345691 


CCCTGAGTATCTGCCATGTA 


CGT 


rs2345692 


TCTCCAACAGGTGGCTTTCA 


ACT 


rs3046306 


TTGCTGGTTAACTCCCCACT 


CGT 


rs4036207 


GCGTGGACTTAAGTCTGTATAAC 


ACT 


rs2345693 


AGAGTCTTATCAATGGAGAATGC 


ACT 


rs2345694 


GAAGAGAAGGATAACTAAATCACT 


ACT 


rs2345695 


ATTTAGTrATCCTTCTCTTCTTG 


ACT 


rs2345696 


ACACAAGTGTAACTTCTACTCT 


ACT 


rs4036209 


GGAAACCAGAATATGCCACC 


CGT 


rs2345697 


AGCCAAAGGGACATATTTTGTGGT 


ACT 


rs2881197 


GGAACACAGTGTATAAGACCAAA 


CGT 


rs2345698 


CGGTGGAACACAGTGTATAAG 


ACT 


rs2345699 


AAAACAAGAACTCTTTTCATTGCC 


ACT 


rs2744640 


TTTATCTGCAGTTCGCCAGC 


ACG 


rs2744639 


AGCACTTAGTATGCCTTCTCGTT 


ACT 


rs2744638 


TGGCAATACCACGTCAGTAAG 


ACT 


rs2744637 


GCTGGGCTGGGTTTGGGCTG 


ACT 


rs2744636 


ACCCGTCACACTCATATAATCCC 


ACG 


rs2744635 


ACACATGCGTGTTCCAGGG 


ACT 


rs2744634 


GGGATTATATGAGTGTGACGG 


ACT 


rs2744633 


GGGTGTAGCGTTAGGTGAG 


ACT 


rs2744632 


GCGCACATAACGCTTGGAAC 


ACT 


rs2744630 


CGTGTTAAAACTCATGGCCAAAC 


ACT 


rs2744629 


ATAAACGACCCTGGAGTTCAT 


ACT 


rs2744628 


TTGAAGAAAACTTTGGCTTAGCCA 


ACT 


rs2744627 


Gill ATG 1 1 CCAGTGGCTAAG 




rs2977616 


TTG AG GTC AAACATTAATATCAAG 


ACT 


rs2977617 


CTAGCTTCTCAATCTTTTGAGTT 


CGT 


rs2744626 


GTGAAATrGTATTTCCGGATTTC 


ACT 


rs2744625 


TCCTGAACACTTATCCACTTTAC 


ACT 


rs31 15847 


CCAGGGCTGGAGGGGCC 


ACT 


rs2744623 


GGTGCTGGCGGGAGCGAGAGT 


ACT 


rs4036193 


GACTCCGTCTCAAAAAAAAAAAAA 


ACT 
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Primer 


Mix 


rs4036194 


CTTGATGTGTGCTTCAGGGTA 


ACQ 


rs4036196 


CAGTGCAAGTAAAGAGCCTTA 


ACT 


rs1042327 


CATCACAGCTCCCCACCAT 


ACT 



Genetic Analysis 

[0231] Allelotyping results from the discovery cohort are shown for cases and controls in Table 1 3. 
The allele frequency for the A2 allele is noted in the fifth and sixth columns for osteoarthritis case pools 
and control pools, respectively, where "AF" is allele frequency. The allele frequency for the Al allele 
can be easily calculated by subtracting the A2 allele frequency from 1 (Al AF = 1-A2 AF). For example, 
the SNP rsl474555 has the following case and control allele frequencies: case Al (C) = 0.64; case A2 
(T) = 0.36; control Al (C) = 0.70; and control A2 (T) = 0.30, where the nucleotide is provided in 
paranthesis. Some SNPs are labeled "untyped" because of failed assays. 



TABLE 13 



dbSNP 
rs# 


Position in 
SEQ ID NO: 
1 


Chromosome 
Position 


A1/A2 
Allele 


F A2 
Case AF 


F A2 
Control AF 


Fp- 
Value 


Rs 1474555 


229 


170689279 


C/T 


0.36 


0.30 


0.024 


Rs1 474554 


6310 


170695360 


A/G 


0.48 


0.43 


0.058 


rs10334 


11840 


170700890 


GfT 








rs10541 


11870 


170700920 


ATT 








RS3823299 


12064 


170701114 


A/G 


0.45 


0.41 


0.125 


RS742348 


13392 


170702442 


C/G 


0.46 


0.44 


0.275 


Rs1 474644 


16354 


170705404 


A/G 


0.75 


0.77 


0.270 


Rs1 474643 


16559 


170705609 


C/T 


0.45 


0.40 


0.042 


RS2056970 


16935 


170705985 


A/G 


0.36 


0.33 


0.242 


RS2223474 


17616 


170706666 


C/T 


0.42 


0.46 


0.140 


RS2206284 


17737 


170706787 


C/T 


0.37 


0.35 


0.493 


rs756519 


18321 


170707371 


C/T 








RS756518 


18453 


170707503 


A/G 


0.49 


0.53 


0.133 


RS756517 


18811 


170707861 


C/T 








Rs 1474642 


20020 


170709070 


C/T 


0.12 


0.12 


0.904 


Rs2038093 


21662 


170710712 


C/G 








Rs2038092 


23197 


170712247 


C/G 








RS2223473 


23446 


170712496 


GfT 


0.42 


0.45 


0.296 


RS760909 


24339 


170713389 


GfT 


0.49 


0.52 


0.255 


RS2076319 


25504 


170714554 


A/G 


0.43 


0.46 


0.219 


RS3778589 


27174 


170716224 


A/G 


0.49 


0.54 


0.081 


Rs3800236 


28008 


170717058 


A/T 


0.47 


0.50 


0.319 


RS2206286 


29294 


170718344 


C/T 


0.81 


0.82 


0.831 


rs12717 


29759 


170718809 


C/G 


0.52 


0.57 


0.081 


rs2 179373 


30832 


170719882 


A/G 


0.58 


0.62 


0.089 


rs3800235 


44512 


170733562 


A/C 


0.60 


0.64 


0.077 


rs3823298 


44850 


170733900 


C/G 


0.44 


0.38 


0.022 


rs2076318 


45884 


170734934 


A/G 


0.41 


0.45 


0.109 


rs2235506 


46345 


170735395 


C/T 


0.68 


0.66 


0.320 


rs2072916 


48589 


170737639 


A/G 


0.48 


0.51 


0.192 


rs3734763 


53371 


170742421 


A/G 


0.50 


0.54 


0.142 
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A1/A2 
Allele 


F A2 
Case AF 


F A2 
Control AF 


F p- 
Value 


rs31 77571 


53911 


170742961 


G/T 








rs8770 


53990 


170743040 


A/G 








rs3173219 


55152 


170744202 


C/G 


0.49 


0.53 


0.056 


rs960744 


55667 


170744717 


C/T 


0.39 


0.35 


0.179 


rs2066954 


58952 


170748002 


A/C 


0.37 


0.32 


0.057 


rs2072917 


59315 


170748365 


GH- 


0.46 


0.42 


0.153 


rs31 73220 


60029 


170749079 


A/G 








rs734249 


61477 


170750527 


A/C 


0.48 


0.40 


0.022 


rs20g2310 


62988 


170752038 


C/T 








rs2092309 


63090 


170752140 


C/G 


0.43 


0.47 


0.165 


rs1016536 


64021 


170753071 


A/C 


0.10 


0.10 


0.985 


rs2235506 


65685 


170754735 


C/T 








rs2076998 


70220 


170759270 


A/G 








rs2076997 


70323 


170759373 


A/C 


0.90 


0.90 


0.814 


rs2345478 


70959 


170760009 


A/C 


0.09 


0.09 


0.947 


rs2021899 


73436 


170762486 


C/G 


0.46 


0.43 


0.218 


rs2021898 


82945 


170771995 


A/G 








rs2345682 


82958 


170772008 


G/T 








rs2345683 


82961 


170772011 


C/G 


0.28 


0.34 


0.019 


rs2881195 


82964 


170772014 


C/T 








rs2345684 


82965 


170772015 


G/T 








rs3046261 


83006 


170772056 


-/CTTT 








rs4083413 


83025 


170772075 


C/T 








rs4083412 


83034 


170772084 


A/G 








rs2345685 


83074 


170772124 


G/T 


0.71 


0.71 


0.835 


rs2021897 


83132 


170772182 


G/T 








rs403621 1 


83155 


170772205 


C/T 








rs4036212 


83172 


170772222 


A/T 








rs4036213 


83174 


170772224 


G/T 








rs2345686 


83206 


170772256 


C/T 








rs4036214 


83216 


170772266 


G/T 








rs4036215 


83234 


170772284 


G/T 








rs2345687 


83252 


170772302 


A/G 


0.55 


0.50 


0.085 


rs2345688 


83260 


170772310 


NC 


0.53 


0.52 


0.958 


rs2881196 


83263 


170772313 


A/C 








rs3046288 


83296 


170772346 


-/AT 








rs4036216 


83319 


170772369 


A/G 








rs4036205 


83322 


170772372 


C/G 








rs2092307 


83324 


170772374 


A/C 








rs4036206 


83357 


170772407 


C/G 








rs2345689 


83375 


170772425 


C/T 








rs2345690 


83381 


170772431 


C/T 








rs2345691 


83389 


170772439 


AfV 








rs2345692 


83443 


170772493 


A/G 








rs3046306 


83499 


170772549 


-/GGTG 


0.42 


0.43 


0.761 


rs4036207 


83545 


170772595 


C/T 








rs2345693 


83566 


170772616 


C/T 








rs2 345694 




170772641 


C/T 








rs2345695 


83619 


170772669 


G/T 








rs2345696 


83698 


170772748 


A/G 








rs4036209 


83780 


170772830 


G/T 


0.79 


0.73 


0.156 


rs2345697 


83784 


170772834 


G/T 








rs2881197 


83826 


170772876 


G/T 








rs2345698 


83832 


170772882 


C/T 








rs2345699 


83852 


170772902 


C/T 








rs2744640 


86297 


170775347 


C/T 


0.53 


0.53 


0.973 


rs2744639 


86315 


170775365 


G/T 


0.40 


0.40 


0.789 
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rs# 


Position 


Allele 


Case AF 


Control AF 


Value 


rs2744638 


86420 


170775470 


C/G 


0.39 


0.39 


0.941 


rs2744637 


86460 


170775510 


C/G 


0.40 


0.42 


0.497 


rs2744636 


86714 


170775764 


CfT 


0.76 


0.73 


0.271 


irs2744635 


86718 


170775768 


CfT 


0.03 


0.02 


0.425 


rs2744634 


86736 


170775786 


C/G 


0.96 


0.94 


0.436 


rs2744633 


86753 


170775803 


CAT 


0.14 


0.16 


0.409 


rs2744632 


86766 


170775816 


G/T 


0.80 


0.83 


0.217 


rs2744630 


88162 


170777212 


C/G 








rs2744629 


88218 


170777268 


A/G 


0.80 


0.80 


0.978 


rs2744628 


88246 


1 70777296 


A/G 


0.71 


0.67 


0.206 


rs2744627 


88255 


1 70777305 


err 


0.32 


0.30 


0.335 


rs2977616 




1 70777359 


G/T 








rs2977617 


88310 


170777360 


An- 








rs2744626 


88471 


170777521 


A/G 








rs2744625 


88619 


170777669 


C/T 








rs31 15847 


88904 


170777954 


C/T 








rs2744623 


89044 


170778094 


C/G 








rs4036193 


90531 


170779581 


-/AAAAA 








rs4036194 


90534 


1 70779584 


A/G 








rs4036196 


90613 


170779663 


C/G 








rs1042327 


46252 


170735302 


C/T 


0.45 


0.39 


0.028 



[0232] The chromosome 6 proximal SNPs were also allelotyped in the replication cohorts using the 
methods described herein and the primers provided in Tables 1 1 and 12. The replication allelotyping 
results for replication cohort #1 and replication cohort #2 are provided in Tables 14 and 15, respectively. 



TABLE 14 



dbSNP 


Position in 
SEQ ID NO: 
1 


Chromosome 


A1/A2 


F A2 


F A2 


Fp- 


rs# 


Position 


Allele 


Case AF 


Control AF 


Value 


rsl 474555 


229 


170689279 


C/T 


0.37 


0.27 


0.004 


rs1 474554 


6310 


170695360 


A/G 


0.50 


0.42 


0.020 


rsl 0334 


11840 


170700890 


G/T 








rsl 0541 


11870 


170700920 


AJT 








rs3823299 


12064 


170701114 


A/G 


0.45 


0.40 


0.080 


rs742348 


13392 


170702442 


C/G 


0.47 


0.41 


0.075 


rsl 474644 


16354 


170705404 


A/G 


0.75 


0.79 


0.231 


rs 1474643 


16559 


1 70705609 


CfT 


0.46 


0.39 


0.028 


rs2056970 


16935 


170705985 


A/G 


0.38 


0.33 


0.129 


rs2223474 


17616 


170706666 


C/T 


0.41 


0.48 


0.052 


rs2206284 


17737 


170706787 


C/T 


0.37 


0.34 


0.342 


rs756519 


18321 


170707371 


C/T 








rs756518 


18453 


170707503 


A/G 


0.48 


0.56 


0.013 


rs756517 


18811 


170707861 


C/T 








rsl 474642 


20020 


170709070 


CfT 


0.10 


0.13 


0.277 


rs2038093 


21662 


170710712 


C/G 








rs2038092 


23197 


170712247 


C/G 








rs2223473 


23446 


170712496 


G/T 


0.42 


0.48 


0.070 


rs760909 


24339 


170713389 


G/T 


0.47 


0.54 


0.077 


rs2076319 


25504 


170714554 


A/G 


0.41 


0.49 


0.017 


rs3778589 


27174 


170716224 


A/G 


0.50 


0.57 


0.035 


rs3800236 


28008 


170717058 


A/T 


0.47 


0.52 


0.126 


rs2206286 


29294 


170718344 


C/T 


0.80 


0.80 


0.952 
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Chromosome 
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A1/A2 
Allele 


F A2 
ase 


F A2 
on ro 


alue 


rs12717 


29759 


170718809 


C/G 


0.53 


0.59 


0.059 


rs2 179373 


30832 


170719882 


A/G 


0.57 


0.64 


0.025 


rs3800235 


44512 


170733562 


A/C 


0.59 


0.65 


0.065 


rs3823298 


44850 


170733900 


C/G 


0.46 


0.36 


0.003 


rs2076318 


45884 


170734934 


A/G 


0.40 


0.47 


0.017 


rs2235506 


46345 


170735395 


err 


0.68 


0.65 


0.434 


rs2072916 


48589 


170737639 


A/G 


0.47 


0.54 


0.026 


rs3734763 


53371 


170742421 


A/G 


0.49 


0.56 


0.052 


rs31 77571 


53911 


170742961 


G/T 








rs8770 


53990 


170743040 


A/G 








rs31 73219 


55152 


170744202 


C/G 


0.49 


0.55 


0.069 


rs960744 


55667 


170744717 


C/T 


0.39 


0.34 


0.131 


rs2066954 


58952 


170748002 


A/C 


0.36 


0.31 


0.096 


rs2072917 


59315 


170748365 


GH- 


0.46 


0.41 


0.070 


rs31 73220 


60029 


170749079 


A/G 








rs734249 


61477 


170750527 


A/C 


0.37 


NA 


0.484 


rs2092310 


62988 


170752038 


C/T 








rs2092309 


63090 


170752140 


C/G 


0.43 


0.49 


0.102 


rs1016536 


64021 


170753071 


A/C 


0.08 


0.11 


0.277 


rs2235506 


65685 


170754735 


C/T 








rs2076998 


70220 


170759270 


A/G 








rs2076997 


70323 


170759373 


A/C 


0.89 


0.91 


0.655 


rs2345478 


70959 


170760009 


A/C 


0.08 


0.09 


0.660 


rs2021899 


73436 


170762486 


C/G 


0.48 


0.42 


0.081 


rs2021898 


82945 


170771995 


A/G 








rs2345682 


82958 


170772008 


G/T 








rs2345683 


82961 


170772011 


C/G 


0.32 


0.39 


0.046 


rs2881195 


82964 


170772014 


C/T 








rs2345684 


82965 


170772015 


G/T 








rs3046261 


83006 


170772056 


-/CTTT 








rs4083413 


83025 


170772075 


C/T 








rs4083412 


83034 


170772084 


A/G 








rs2345685 


83074 


170772124 


G/T 


0.69 


0.70 


0.772 


rs2021897 


83132 


170772182 


G/T 








rs4036211 


83155 


170772205 


C/T 








rs4036212 


83172 


170772222 


A/T 








rs4036213 


83174 


170772224 


G/T 








rs2345686 


83206 


170772256 


C/T 








rs4036214 


83216 


170772266 


G/T 








rs4036215 


83234 


170772284 


G/T 








rs2345687 


83252 


170772302 


A/G 


0.62 


NA 


NA 


rs2345688 


83260 


170772310 


A/C 


0.46 


0.49 


0.383 


rs2881196 


83263 


170772313 


A/C 








rs3046288 


83296 


170772346 


-/AT 








rs4036216 


83319 


170772369 


A/G 








rs4036205 


83322 


170772372 


C/G 








rs2092307 


83324 


170772374 


A/C 








rs4036206 


83357 


1 70772407 


C/G 








rs2345689 


83375 


170772425 


C/T 








rs2345690 


83381 


170772431 


C/T 








rs2345691 


83389 


170772439 


A/T 








rs2345692 


83443 


170772493 


A/G 








rs3046306 


83499 


170772549 


-/GGTG 


0.39 


0.40 


0.729 


rs4Q36207 


83545 


170772595 


C/T 








rs2345693 


83566 


170772616 


C/T 








rs2345694 


83591 


170772641 


C/T 








rs2345695 


83619 


170772669 


G/T 
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A1/A2 

All 

e e 


F A2 
Case A* 


F A2 
Oontrol AF 


b p- 
Value 


rs2345696 


83698 


170772748 


A/G 








rs4036209 


83780 


170772830 


GfT 


0.79 


0.73 


0.156 


rs2345697 


83784 


170772834 


GfT 








rs2881197 


83826 


170772876 


GfT 








rs2345698 


83832 


170772882 


C/T 








rs2345699 


83852 


170772902 


C/T 








rs2744640 


86297 


170775347 


C/T 


0.49 


0.51 


0.583 


rs2744639 


86315 


170775365 


G/T 


0.45 


0.43 


0.745 


rs2744638 


86420 


170775470 


C/G 


0.38 


0.38 


0.852 


rs2744637 


86460 


170775510 


C/G 


0.35 


0.40 


0.216 


rs2744636 


86714 


170775764 


C/T 


0.71 


0.73 


0.482 


rs2744635 


86718 


170775768 


CfT 


0.05 


0.03 


0.195 


rs2744634 


86736 


170775786 


C/G 


0.93 


0.92 


0.601 


rs2744633 


86753 


170775803 


C/T 


0.19 


0.20 


0.681 


rs2744632 


86766 


170775816 


GfT 


0.85 


0.90 


0.070 


rs2744630 


88162 


170777212 


C/G 








rs2744629 


88218 


170777268 


A/G 


0.78 


0.79 


0.891 


rs2744628 


88246 


170777296 


A/G 


0.68 


0.67 


0.766 


rs2744627 


88255 


170777305 


CfT 


0.32 


0.30 


0.636 


rs2977616 


88309 


170777359 


GfT 








rs2977617 


88310 


170777360 


A/r 








rs2744626 


88471 


170777521 


A/G 








rs2744625 


88619 


170777669 


C/T 








rs31 15847 


88904 


170777954 


C/T 








rs2744623 


89044 


170778094 


C/G 








rs4036193 


90531 


170779581 


-/AAAAA 








rs4036194 


90534 


170779584 


A/G 








rs4036196 


90613 


170779663 


C/G 








rs1042327 


46252 


170735302 


C/T 


0.46 


0.37 


0.004 



TABLE 15 



dbSNP 


Position in 
SEQ ID NO: 
1 


Chromosonie 


A1/A2 


F A2 


F A2 


Fp- 


rs# 


Position 


AUele 


Case AF 


Control AF 


Value 


rs 1474555 


229 


170689279 


CfT 


0.35 


0.36 


0.770 


rs 1474554 


6310 


170695360 


A/G 


0.45 


0.44 


0.873 


rs10334 


11840 


170700890 


G/T 








rs10541 


11870 


170700920 


PJT 








rs3823299 


12064 


170701114 


A/G 


untyped 


0.43 


NA 


rs742348 


13392 


170702442 


C/G 


0.45 


0.47 


0.600 


rsl 474644 


16354 


170705404 


A/G 


0.74 


0.75 


0.775 


rsl 474643 


16559 


170705609 


CfT 


0.43 


0.41 


0.614 


rs2056970 


16935 


170705985 


/KfG 


0.33 


0.33 


0.978 


rs2223474 


17616 


170706666 


CfT 


0.44 


0.43 


0.944 


rs2206284 


17737 


170706787 


C/T 


0.36 


0.37 


0.901 


rs756519 


18321 


170707371 


CfT 








rs756518 


18453 


170707503 


A/G 


0.50 


0.47 


0.453 


rs756517 


18811 


170707861 


CfT 








rsl 474642 


20020 


170709070 


CfT 


0.15 


0.11 


0.147 


rs2038093 


21662 


170710712 


C/G 








rs2038092 


23197 


170712247 


C/G 








rs2223473 


23446 


170712496 


GfT 


0.43 


0.40 


0.408 


rs760909 


24339 


170713389 


GfT 


0.51 


0.48 


0.506 


rs2076319 


25504 


170714554 


A/G 


0.44 


0.40 


0.264 


rs3778589 


27174 


170716224 


A/G 


0.49 


0.48 


0.910 
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dbSNP 
rs# 


Position in 
SEQ ID NO: 
1 


Cliromosome 
Position 


A1/A2 
Allele 


F A2 


F A2 
Control AF 


F p- 
Value 


rs3800236 


28008 


170717058 


A/T 


0.48 


0.46 


0.670 


rs22G6286 


29294 


170718344 


C/T 


0.83 


0.84 


0.685 


rs12717 


29759 


170718809 


C/G 


0.51 


0.53 


0.726 


rs2 179373 


30832 


170719882 


A/G 


0.59 


0.58 


0.880 


rs3800235 


44512 


170733562 


A/C 


0.60 


0.62 


0.632 


rs3823298 


44850 


170733900 


C/G 


0.41 


0.41 


0.945 


rs2076318 


45884 


170734934 


A/G 


0.43 


0.42 


0.636 


rs2235506 


46345 


170735395 


C/T 


0.69 


0.67 


0.594 


rs2072916 


48589 


170737639 


NG 


0.49 


0.46 


0.399 


rs3734763 


53371 


170742421 


NG 


0.51 


0.51 


0.888 


rs31 77571 


53911 


170742961 


G/T 








rs8770 


53990 


170743040 


NG 








rs3173219 


55152 


170744202 


C/G 


0.48 


0.51 


0.493 


rs960744 


55667 


170744717 


C/T 


0.38 


0.37 


0.738 


rs2066954 


58952 


170748002 


A/C 


0.37 


0.34 


0.378 


rs2072917 


59315 


170748365 


G/r 


0.45 


0.45 


0.982 


rs31 73220 


60029 


170749079 


MQ 








rs734249 


61477 


170750527 


NC 


0.46 


0.02 




rs2092310 


62988 


170752038 


C/T 








rs2092309 


63090 


170752140 


C/G 


0.43 


0.44 


0.891 


rs1016536 


64021 


170753071 


A/C 


0.13 


0.09 


0.173 


rs2235506 


65685 


170754735 


C/T 








rs2076998 


70220 


170759270 


NG 








rs2076997 


70323 


170759373 


NC 


0.92 


0.89 


0.256 


rs2345478 


70959 


170760009 


NC 


0.11 


0.10 


0.545 


rs2021899 


73436 


170762486 


C/G 


0.44 


0.45 


0.797 


rs2021898 


82945 


170771995 


NG 








rs2345682 


82958 


170772008 


G/T 








rs2345683 


82961 


170772011 


C/G 


0.23 


0.26 


0.407 


rs2881195 


82964 


170772014 


C/T 








rs2345684 


82965 


170772015 


G/T 








rs3046261 


83006 


170772056 


-/CTTT 








rs4083413 


83025 


170772075 


C/T 








rs4083412 


83034 


170772084 


A/G 








rs2345685 


83074 


170772124 


G/T 


0.74 


0.71 


0.533 


rs2021897 


83132 


170772182 


G/T 








rs4036211 


83155 


170772205 


C/T 








rs4036212 


83172 


1 70772222 


PJT 








rs4036213 


83174 


170772224 


G/T 








rs2345686 


83206 


170772256 


C/T 








rs4036214 


83216 


170772266 


G/T 








rs4036215 


83234 


170772284 


G/T 








rs2345687 


83252 


170772302 


A/G 


0.47 


0.50 


0.457 


rs2345688 


83260 


170772310 


NC 


0.61 


0.58 


0.434 


rs2881 1 96 


83263 


170772313 


NC 








rs3046288 


83296 


170772346 


-/AT 








rs4036216 


83319 


170772369 


NG 








rs4036205 




170772372 


C/G 








rs2092307 


83324 


170772374 


A/C 








rs4036206 


83357 


170772407 


C/G 








rs2345689 


83375 


170772425 


C/T 








rs2345690 


83381 


170772431 


C/T 








rs2345691 


83389 


170772439 


NT 








rs2345692 


83443 


170772493 


NG 








rs3046306 


83499 


170772549 


-/GGTG 








rs4036207 


83545 


170772595 


C/T 








rs2345693 


83566 


170772616 


C/T 
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dbSNP 
rs# 


Position in 
SEQ ID NO: 
1 


Chromosome 
Position 


A1/A2 
Allele 


F A2 
Case AF 


F A2 
Control AF 


F p- 
Value 


rs2345694 


83591 


170772641 


CAT 








rs2345695 


83619 


170772669 


GfT 








rs23456g6 


83698 


170772748 


A/G 








rs4036209 


83780 


170772830 


GfT 








rs2345697 


83784 


170772834 


GfT 








rs2881197 


83826 


170772876 


GfT 








rs2345698 


83832 


1 70772882 


CfT 








rs2345699 


83852 


170772902 


CfT 








rs274464G 


86297 


170775347 


CfT 


0.57 


0.55 


0.595 


rs2744639 


86315 


170775365 


GfT 


0.35 


0.34 


0.752 


rs2744638 


86420 


170775470 


C/G 


0.41 


0.40 


0.793 


rs2744637 


86460 


170775510 


C/G 


0.47 


0.46 


0.836 


rs2744636 


86714 


170775764 


CfT 


0.83 


NA 




rs2744635 


86718 


170775768 


CfT 








rs2744634 


86736 


1 70775786 


C/G 


untyped 


0.97 


NA 


rs2744633 


86753 


1 70775803 


CfT 


0.09 


0.10 


0.691 


rs2744632 


86766 


170775816 


GfT 


0.74 


0.72 


0.529 


rs2744630 


88162 


170777212 


C/G 








rs2744629 


88218 


170777268 


AJG 


0.81 


0.81 


0.959 


rs2744628 


88246 


170777296 


AJG 


0.74 


NA 




rs2744627 


88255 


170777305 


CfT 


0.33 


0.29 


0.341 


rs2977616 


88309 


1 70777359 


GfT 








rs2977617 


88310 


170777360 


A/T 








rs2744626 


88471 


170777521 


A/G 








rs2744625 


88619 


170777669 


CfT 








rs31 15847 


88904 


170777954 


CfT 








rs2744623 


89044 


170778094 


C/G 








rs4036193 


90531 


1 70779581 


./AAAAA 








rs4036194 


90534 


1 70779584 


NQ 








rs4036196 


90613 


1 70779663 


C/G 








rs1042327 


46252 


1 70735302 


CfT 


0.42 


0.43 


0.880 



[0233] Allelotyping results were considered particularly significant with a calculated p-value of less 
than or equal to 0.05 for allelotype results. These values are indicated in bold. The allelotyping p-values 
were plotted in Figure 1 A for the discovery cohort. The position of each SNP on the chromosome is 
presented on the x-axis. The y-axis gives the negative logarithm (base 10) of the p-value comparing the 
estimated allele in the case group to that of the control group. The minor allele frequency of the control 
group for each SNP designated by an X or other symbol on the graphs in Figure 1 A can be determined by 
consulting Table 13. For example, the left-most X on the left graph is at position 170689279. By 
proceeding down the Table from top to bottom and across the graphs from left to right the allele 
frequency associated with each symbol shown can be determined. 

[0234] To aid the interpretation, multiple lines have been added to the graph. The broken horizontal 
lines are drawn at two common significance levels, 0.05 and 0.01 . The vertical broken lines are drawn 
every 20kb to assist in the interpretation of distances between SNPs. Two other lines are drawn to 
expose linear trends in the association of SNPs to the disease. The light gray line (or generally bottom- 
most curve) is a nonlinear smoother through the data points on the graph using a local polynomial 
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regression method (W.S. Cleveland, E. Grosse and W.M. Shyu (1992) Local regression models. Chapter 
8 of Statistical Models in S eds J.M. Chambers and T.J. Hastie, Wadsworth & Brooks/Cole.). The black 
line provides a local test for excess statistical significance to identify regions of association. This was 
created by use of a lOkb sliding window with Ikb step sizes. Within each window, a chi-square 
goodness of fit test was applied to compare the proportion of SNPs that were significant at a test wise 
level of 0.01 , to the proportion that would be expected by chance alone (0.05 for the methods used here). 
Resulting p- values that were less than 10"^ were truncated at that value. 

[0235] Finally, the exons and introns of the genes in the covered region are plotted below each 
graph at the appropriate chromosomal positions. The gene boundary is indicated by the broken 
horizontal line. The exon positions are shown as thick, unbroken bars. An arrow is place at the 3' end of 
each gene to show the direction of transcription. 

Example 5 
ELP3 Region Proximal SNPs 

[0236] It has been discovered that SNP rs 1563055 in elongation protein 3 homolog {ELP3) is 
associated with occurrence of osteoarthritis in subjects. 

[0237] Thirty-three additional allelic variants proximal to rsl 563055 were identified and 
subsequently allelotyped in osteoarthritis case and control sample sets as described in Examples 1 and 2. 
The polymorphic variants are set forth in Table 16. The chromosome positions provided in column four 
of Table 16 are based on Genome "Build 34" of NCBI's GenBank. 



TABLE 16 



dbSNP 
rs# 


Chromosome 


Position in SEQ 
ID NO: 2 


Chromosome 
Position 


Allele 
Variants 


rs1 000658 


8 


211 


27927511 


C/t 


rs1 984880 


8 


473 


27927773 


c/t 


rsgg9112 


8 


1536 


27928836 


c/t 


rs735880 


8 


5639 


27932939 


c/t 


rs2045029 


8 


17186 


27944486 


a/a 


rs2045028 


8 


17335 


27944635 


c/t 


rs1 947384 


8 


25029 


27952329 


c/g 


rs1 947385 


8 


25111 


27952411 


c/t 


rs1901744 


8 


28811 


27956111 


a/a 


rs1901745 


8 


28863 


27956163 


a/t 


rs971882 


8 


30809 


27958109 


a/c 


rs1 377338 


8 


40985 


27968285 


a/c 


rs2305452 


8 


45147 


27972447 


c/t 


rs2305451 


8 


45282 


27972582 


a/g 


rs2 123472 


8 


46168 


27973468 


fl/t 


rs21 67768 


8 


46328 


27973628 


a/q 
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dbSNP 
rs# 


C b romosonie 


Position in SEQ 
ID NO: 2 


Clironiosome 
Position 


Allele 
Variants 


rs1563055 


8 


49077 


27976377 


a/Q 


rs2290371 


8 


51925 


27979225 


c/t 


rs2290370 


8 


52141 


27979441 


a/q 


rs22g036g 


8 


52168 


27979468 


c/t 


rs2874904 


8 


60852 


27988152 


dt 


rs3213997 


8 


62468 


27989768 


a/q 


rs3213998 


8 


65572 


27992872 


fl/t 


rs1 530929 


8 


79089 


28006389 


a/c 


rs1 000275 


8 


79541 


28006841 


c/t 


rs1 000274 


8 


79790 


28007090 


c/t 


rs3757896 


8 


90843 


28018143 


a/g 


rs3757895 


8 


90978 


28018278 


c/t 


rs3757894 


8 


91052 


28018352 


c/q 


rs3757893 


8 


91131 


28018431 


a/q 


rs3757892 


8 


91132 


28018432 


c/t 


rs3757891 


8 


94439 


28021739 


a/q 


rs3757890 


8 


94621 


28021921 


a/t 



Assay for Verifying and Allelotyping SNPs 

[0238] The methods used to verify and allelotype the 33 proximal SNPs of Table 16 are the same 
methods described in Examples 1 and 2 herein. The primers and probes used in these assays are provided 
in Table 17 and Table 18, respectively. 



TABLE 17 



dbSNP 
rs# 


Forward 
PCR primer 


Reverse 
PCR primer 


Rsl 000658 


ACGTTGGATGTTCTCAAAAAAGAAACACAT 


ACGTTGGATGGGGTTATCAGTTTGAGATTC 


Rs 1984880 


ACGTTGGATGCCATTTGCCAATTCCTGTGG 


AGGTTGGATGATGGGCTGAAATGTATCCCC 


rs999112 


AGGTTGGATGCTAAGCACATGCCTTTCTTG 


ACGTTGGATGCTA1TTTCTACTGGGAGATG 


rs735880 


ACGTTGGATGTGCCTTCATTCTCCAACCAC 


ACGTTGGATGAACAGAGTGAGACCCATCTG 


RS2045029 


ACGTTGGATGAGTCATTGCTAGCTTTCTGG 


ACGTTGGATGGGGACTTTAGGGAAGTTATAG 


RS2045028 


ACGTTGGATGAGGTTGTAGTGAGCCGAGAT 


ACGTTGGATGTGAGACAGAGTCTTGCTCTG 


Rsl 947384 


ACGTTGGATGATTCTCCACCGAGAAACCAG 


ACGTTGGATGTTGTGGCAGCAAGAAGGAAC 


Rsl 947385 


ACGTTGGATGAAATTTCAACAGTCAACAAT 


ACGTTGGATGGTCAGTTTTGAAAACTGATC 


Rsl 901 744 


AGGTTGGATGCCTTGATTGAAGAGTAAAGG 


ACGTTGGATGATC/XAATATTCCTCATCCCC 


Rsl 901 745 


ACGTTGGATGCTTCTGGCTTTACCTGTGTG 


ACGTTGGATGAAATGAAGCAGCACTCACAG 


rs971882 


ACGTTGGATGAAGCCCTAATCATTGGTACG 


ACGTTGGATGGATGGGTGCTAAAAAGACAC 


Rs1 377338 


ACGTTGGATGCCCACATATCTACACATCAAG 


ACGTTGGATGAGGGAGATAGGTGGTTAAAG 


RS2305452 


ACGTTGGATGCCGTGTTGCAACTAACAGGG 


ACGTTGGATGAGACGTTCCCATCCTCCATC 


Rs2305451 


ACGTTGGATGGCAGAGCCACCAGAGATAAA 


ACGTTGGATGTTTTACGACAGGCGGGATTG 


Rs2 123472 


ACGTTGGATGCACTTAGAATTGTTGCTTGG 


ACGTTGGATGGCTGTATCTGTGACCTCAAA 


Rs2 167768 


ACGTTGGATGG AATC AACATGACTTG GTGAC 


ACGTTGGATGATCTCACTCTAACTTGCTCC 


rsl 563055 


ACGTTGGATGAGTTCTTTCTCCTCACATTG 


AGGTTGG ATG CCCTTTAG AAGCACATACTC 


Rs2290371 


ACGTTGGATGATCCTCTTGGTAGCTTGTCC 


ACGTTGGATGCTGTCTTGGTTTTCACCCTG 
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dbSNP 
rs# 


Forward 
PGR primer 


Reverse 
PCR primer 


RS2290370 


ACGTTGGATGCAACCTCTACCTCACTACAC 


ACGTTGGATGATGAGGTATCGACACACTGG 


Rs2290369 


ACGTTGGATGACACACTGGGTATCTGTTCT 


ACGTTGGATGTCAGAATCCCCAACCTCTAC 


RS2874904 


ACGTTGGATGAAATTCCAGGCTGGGTACAG 


ACGTTGGATGTGGTGACCTTAAGTGATGCG 


Rs32 13997 


ACGTTGGATGGGTTGGCTAGAAGAGAAAAA 


ACGTTGGATGTACAGTCCTTTTGAAACTAC 


Rs32 13998 


ACGTTGGATGACAGTTTGTTGACATAGTAG 


ACGTTGGATGAGGCTGAAAAGACATTCATG 


Rsl 530929 


ACGTTGGATGGGCTTTCACTATATTTCCTC 


ACGTTGGATGGAATACAGTAAGCCTATGGG 


Rsl 000275 


ACGTTGGATGAACCCCAGAAAGCAAAAAGC 


ACGTTGGATGGACGCTTGCTAACTTAATGG 


Rsl 000274 


ACGTTGGATGGCCTAAGACAGGATCCAAAC 


ACGTTGGATGTTACTGCGTGCCTTAGTACC 


Rs3757896 


ACGTTGGATGCCTTCAAGCAAGTCAGTTAC 


ACGTTGGATGCAGAAACTGTGTGACTGATC 


RS3757895 


ACGTTGGATGAAAATCATTGGCCAAACTGC 


ACGTTGGATGCTCCTTAGTATTCTTAGGTG 


RS3757894 


ACGTTGGATGAGAAGGGTTGAACAACAAGG 


ACGTTGGATGCACCTAAGAATACTAAGGAG 


RS3757893 


ACGTTGGATGCCCTTGTTGTTCAACCCTTC 


ACGTTGGATGCTGCATGTGGATACCTACAC 


RS3757892 


ACGTTGGATGTCCTGCATGTGGATACCTAC 


ACGTTGGATGCCCTTGTTGTTCAACCCTTC 


RS3757891 


ACGTTGGATGATGGGCCAATTCTCCATAGG 


ACGTTGGATGAGGCCTGTTAAGGAAACCTG 


Rs3757890 


ACGTTGGATGCAGGTGGATGTAGGCTTAAG 


ACGTTGGATGGCACCACTGCCTCTTGTTTT 



TABLE 18 



dbSNP 


Extend 


Term 


rs# 


Primer 


Mix 


rsl 000658 


AATTGACAATGTTGGGACTGTT 


ACG 


rs1 984880 


TGTGGTGTAAATAGGAGTTAGTGG 


ACT 


rs999112 


GCACATGCCTTTCTTGGAACTG 


ACG 


rs735880 


AACCTTTACTTGTACTACATGC 


ACG 


rs2045029 


GCTAGCTTTCTGGTAATGAAAAT 


ACT 


rs2045028 


GATCGCACCACTGCACTCCAG 


ACG 


rsl 947384 


ATAGCGGCAGTCCAAAAAGC 


ACT 


rsl 947385 


TTCAACAGTCAACAATGAAACC 


ACT 


rsl 901 744 


ATAGTCAAGTATGCAAATGAAGC 


ACT 


rsl 901 745 


CCTTTACCTGTGTCTTCCCT 


CGT 


rs971882 


CCTAATCATTGGTACGGTCTCA 


ACT 


rsl 377338 


AGTATTAGCTCAAATATCACATTG 


ACT 


rs2 305452 


CAGGGTAGCAGGCGGCC 


ACG 


rs2305451 


CCACAAACTCAGACCACGG 


ACT 


rs2 123472 


CAGTTAATGTCAAGAAGCATAG 


ACT 


rs2 167768 


ACATGACTTGGTGACAGAAGAA 


ACT 


rs 1563055 


TTCTCCTCACATTGTTTCTACT 


ACG 


rs2290371 


GGTAGCTTGTCCTTAAATAACCGT 


ACT 


rs2290370 


GGAGCAGGGACTTCTGCCA 


ACT 


rs2290369 


AGTCCCTGCTCCATGTGAC 


ACT 


rs2874904 


GGCTAACGCCTGTAATCCCA 


ACT 


rs3213997 


AGAAAAATATTGTTATGCCCACA 


ACG 


rs32139g8 


TAGTATTCTCAAATAGAGAGATTC 


ACT 


rsl 530929 


TTTCCTCTTTCCAGAATTGTATTT 


ACT 


rs1 000275 


ATGAGAATATCCTAGAATGAGGCA 


ACG 
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dbSNP 


Extend 


Term 








rs1 000274 


GAATCATCAGGTCCTGTGCC 


ACG 


rs3757896 


TAATTCTCCTTAAGTAGTTAATTC 


ACT 


rs3757895 


TTGGCCAAACTGCAGGATCT 


ACT 


rs3757894 


AAGGGCCACACAAGCAATTTCAA 


ACT 


rs3757893 


CCAAAGGACATTAGGTGGTG 


ACG 


rs3757892 


TGTGGATACCTACACTGCTC 


ACG 


rs3757891 


AGGATAAGTGTAACGGGGTC 


ACT 


rs3757890 


AGTGACACTCTTACTTCACAC 


CGT 



Genetic Analysis 

[0239] Allelotyping results from the discovery cohort are shown for cases and controls in Table 19. 
The allele frequency for the A2 allele is noted in the fifth and sixth columns for osteoarthritis case pools 
and control pools, respectively, where "AF" is allele frequency. The allele frequency for the Al allele 
can be easily calculated by subtracting the A2 allele frequency from 1 (Al AF = 1-A2 AF). For example, 
the SNP rsl000658 has the following case and control allele frequencies: case Al (C) = 0.36; case A2 
(T) = 0.64; control Al (C) = 0.37; and control A2 (T) = 0.63, where the nucleotide is provided in 
paranthesis. Some SNPs are labeled "untyped" because of failed assays. 



TABLE 19 



dbSNP 
rs# 


Position in 
SEQ ID NO: 
2 


Chromosome 
Position 


A1/A2 
Allele 


F A2 
Case AF 


F A2 
Control AF 


Fp- 
Value 


Rs1 000658 


211 


27927511 


C/T 


0.79 


0.80 


0.591 


Rs1 984880 


473 


27927773 


C/J 


0.47 


0.48 


0.735 


RS999112 


1536 


27928836 


err 


0.72 


0.72 


0.775 


RS735880 


5639 


27932939 


cn 


0.20 


0.19 


0.561 


RS2045029 


17186 


27944486 


AIG 


0.54 


0.56 


0.361 


RS2045028 


17335 


27944635 


C/T 








Rs1 947384 


25029 


27952329 


C/G 


0.63 


0.60 


0.122 


Rs 1947385 


25111 


2795241 1 


C/T 








Rs1901744 


28811 


27956111 


A/G 


0.18 


0.18 


0.796 


Rs1901745 


28863 


27956163 


ATT 


0.14 


0.18 


0.117 


RS971882 


30809 


27958109 


A/C 








Rs1 377338 


40985 


27968285 


A/C 


0.28 


0.24 


0.085 


Rs2305452 


45147 


27972447 


C/T 


0.31 


0.27 


0.078 


RS2305451 


45282 


27972582 


A/G 


0.48 


0.52 


0.130 


Rs2 123472 


46168 


27973468 


GIT 


0.42 


0.45 


0.239 


Rs2 167768 


46328 


27973628 


A/G 


0.38 


0.35 


0.350 


Rs1 563055 


49077 


27976377 


A/G 








RS2290371 


51925 


27979225 


CAT 


0.28 


0.24 


0.039 


RS2290370 


52141 


27979441 


A/G 


0.85 


0.84 


0.551 


RS2290369 


52168 


27979468 


C/T 


0.43 


0.47 


0.138 


RS2874904 


60852 


27988152 


C/T 


0.26 


0.23 


0.132 


Rs32 13997 


62468 


27989768 


A/G 


0.44 


0.47 


0.201 


RS3213998 


65572 


27992872 


Gn- 


0.83 


0.80 


0.223 


Rs1 530929 


79089 


28006389 


A/C 


0.47 


0.49 


0.556 
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dbSNP 


Position in 
SEQroNO: 
2 


Chromosome 


A1/A2 


F A2 


F A2 




rs# 


Position 


AUele 


Case AF 


Control AF 


Value 


Rs1 000275 


79541 


28006841 


C/T 


0.86 


0.87 


0.771 


RS1000274 


79790 


28007090 


err 


0.54 


0.56 


0.510 


RS3757896 


90843 


28018143 


AJG 








RS3757895 


90978 


28018278 


CfT 


0.46 


0.47 


0.874 


RS3757894 


91052 


28018352 


C/G 


0.08 


0.09 


0.709 


RS3757893 


91131 


28018431 


A/G 


0.16 


0.15 


0.590 


RS3757892 


91132 


28018432 


CAT 


0.09 


0.08 


0.595 


Rs3757891 


94439 


28021739 


A/G 








RS3757890 


94621 


28021921 


An 


0.98 


0.96 


0.167 



[0240] The ELP3 proximal SNPs were also allelotyped in the replication cohorts using the methods 
described herein and the primers provided in Tables 17 and 18. The replication allelotyping results for 
replication cohort #1 and replication cohort #2 are provided in Tables 20 and 21 , respectively. 



TABLE 20 



dbSNP 
rs# 


Position in 
SEQ ID NO: 




Chromosome 
Position 


A1/A2 
Allele 


F A2 
Case AF 


F A2 
Control AF 


Fp- 


rs 1000658 




2792751 1 


C/T 


0.78 


0.79 


0.863 


rs1 984880 





0700777*5 










rs999112 


1536 


27928836 


CfT 


0.71 


0.70 


0.759 


rs735880 


5639 


27932939 


C/T 


0.20 


0.17 


0.255 


rs2045029 


17186 


27944486 


A/G 


0.55 


0.57 


0.526 


rs2045028 


17335 


27944635 


CfT 








rsl 947384 


25029 


27952329 


C/G 


0.65 


0.61 


0.198 


rs1 947385 


25111 


2795241 1 


C/T 








rs1901744 


28811 


279561 1 1 


A/G 


0.19 


0.18 


0.674 


rs1901745 


28863 


27956163 


A/T 


0.15 


0.18 


0.448 


rs971882 


30809 


27958109 


NC 








rsl 377338 


40985 


27968285 


NC 


0.29 


0.22 


0.039 


rs2305452 


45147 


27972447 


CfT 


0.31 


0.26 


0.067 


rs2305451 


45282 


27972582 


A/G 


0.49 


0.56 


0.063 


rs2 123472 


46168 


27973468 


G/T 


0.42 


0.49 


0.039 


rs2 167768 


46328 


27973628 


A/G 


0.36 


0.34 


0.396 


rs 1563055 


49077 


27976377 


A/G 








rs2290371 


51925 


27979225 


cn- 


0.28 


0.23 


0.054 


rs2290370 


52141 


27979441 


A/G 


0.85 


0.83 


0.488 


rs2290369 


52168 


27979468 


CfT 


0.41 


0.49 


0.036 


rs2874904 


60852 


27988152 


C/T 


0.29 


0.22 


0.062 


rs32 13997 


62468 


27989768 


A/G 


0.44 


0.50 


0.064 


rs32 13998 


65572 


27992872 


G/T 


0.84 


0.82 


0.336 


rsl 530929 


79089 


28006389 


AfC 


0.48 


0.52 


0.311 


rsl 000275 


79541 


28006841 


CfT 


0.86 


0.87 


0.566 


rs1 000274 


79790 


28007090 


C/T 


0.54 


0.59 


0.159 


rs3757896 


90843 


28018143 


A/G 








rs3757895 


90978 


28018278 


C/T 


0.45 


0.49 


0.308 


rs3757894 


91052 


28018352 


C/G 


0.09 


0.09 


0.914 


rs3757893 


91131 


28018431 


/K/G 


0.15 


0.14 


0.803 


rs3757892 


91132 


28018432 


CfT 


0.09 


0.08 


0.798 


rs3757891 


94439 


28021739 


A/G 








rs3757890 


94621 


28021921 


AfT 


0.98 


0.95 


0.159 
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TABLE 21 



dbSNP 
rs# 




Position in 
SEQ ID NO: 

? 


Chromosome 
Position 


A1/A2 
AUele 


FA2 
Case AF 


F A2 
Control AF 


Fp- 


rs1 000658 


211 


2792751 1 


CAT 


— 2i§Q — 


0.82 


0.443 


rs1 984880 


47J 


27927773 


C/T 


0A8 


0^47 


— 0.898 — 


rs9991 1 2 


1536 


27928836 


C/T 





0J6 


0.319 


rs735880 


5639 


27932939 


C/T 


0^20 


0,22 


— 0.598 — 


rs2045029 


1 /lob 


27944486 


A/G 


0^52 


0,54 


— 0.581 — 


rs2045028 


oln^Q 


27944635 











rsiw f siyt 




27952329 







032 




0,59 




— 0.348 — 


rs 1947385 


25111 


2795241 1 


C/T 








rs1901744 


28811 


279561 1 1 


A/G 


0/\8 


0.18 


0.928 


rs1901745 


28863 


279561 63 


A/T 


0J3 


0.17 


0.113 


rs971 882 


30809 


279581 09 


fiJC 








rsl 377338 


40985 


27968285 


A/C 


0^27 


027 


— 0.961 


rs2305452 


I^OQ^ 


27972447 


C/T 


0^32 


0,30 


— 0.673 — 


rs2305451 


45282 


27972582 


A/G 


0A7 


0A7 


— 0.911 


rs2 123472 


46168 


27973468 


G/T 


0^41 


0.38 


0.348 


rs2 167768 


46328 


27973628 


A/G 


039 


0.37 


0.664 


rs1 563055 


49077 


27976377 


A/G 








rs2290371 


51925 


27979225 


C/T 


0^28 


0,25 


— 0.403 — 


rs2290370 


52141 


27979441 


A/G 


0;85 


0,84 


0.939 


rs2290369 


52168 


27979468 


C/T 


0.46 


0A4 


0.712 


rs2874904 




27988152 


C/T 






0.888 


rs32 1 3997 


62468 


27989768 




045 


043 




rs32 13998 


65572 


27992872 


G/T 


0.81 


0.78 


0.373 


rsl 530929 


79089 


28006389 


A/C 


0.46 


0.43 


0.445 


rsl 000275 


79541 


28006841 


C/T 


0.87 


0.86 


0.767 


rsl 000274 


79790 


28007090 


C/T 


0.54 


0.51 


0.394 


rs3757896 


90843 


28018143 


A/G 








rs3757895 


90978 


28018278 


C/T 


0.47 


0.42 


0.202 


rs3757894 


91052 


28018352 


C/G 


0.07 


0.09 


0.478 


rs3757893 


91131 


28018431 


A/G 


0.17 


0.16 


0.653 


rs3757892 


91132 


28018432 


C/T 


0.09 


0.07 


0.567 


rs3757891 


94439 


28021739 


A/G 








rs3757890 


94621 


28021921 


A/T 


0.97 


0.97 


0.728 



[0241] Allelotyping results were considered particularly significant with a calculated p-value of less 
than or equal to 0.05 for allelotype results. These values are indicated in bold. The allelotyping p-values 
were plotted in Figure IB for the discovery cohort. The position of each SNP on the chromosome is 
presented on the x-axis. The y-axis gives the negative logarithm (base 10) of the p-value comparing the 
estimated allele in the case group to that of the control group. The minor allele frequency of the control 
group for each SNP designated by an X or other symbol on the graphs in Figure IB can be determined by 
consulting Table 19. For example, the left-most X on the left graph is at position 27927511. By 
proceeding down the Table from top to bottom and across the graphs from left to right the allele 
frequency associated with each symbol shown can be determined. 

[0242] To aid the interpretation, multiple lines have been added to the graph. The broken horizontal 
lines are drawn at two common significance levels, 0.05 and 0.01 . The vertical broken lines are drawn 
every 20kb to assist in the interpretation of distances between SNPs. Two other lines are drawn to 
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expose linear trends in the association of SNPs to the disease. The light gray line (or generally bottom- 
most curve) is a nonlinear smoother through the data points on the graph using a local poljmomial 
regression method (W.S. Cleveland, E. Grosse and W.M. Shyu (1992) Local regression models. Chapter 
8 of Statistical Models m S eds J.M. Chambers and T.J. Hastie, Wadsworth & Brooks/Cole.). The black 
line provides a local test for excess statistical significance to identify regions of association. This was 
created by use of a lOkb sliding window with Ikb step sizes. Within each window, a chi-square 
goodness of fit test was applied to compare the proportion of SNPs that were significant at a test wise 
level of 0.01, to the proportion that would be expected by chance alone (0.05 for the methods used here). 
Resulting p-values that were less than 10 * were truncated at that value. 

[0243] Finally, the exons and introns of the genes in the covered region are plotted below each 
graph at the appropriate chromosomal positions. The gene boundary is indicated by the broken 
horizontal line. The exon positions are shown as thick, unbroken bars. An arrow is place at the 3' end of 
each gene to show the direction of transcription. 

Examplg $ 
CHDCl Region Proximal SNPs 

[02441 It has been discovered that SNP rs912428 in calponin homology (CH) domain containing 1 
{CHDCl) is associated with occurrence of osteoarthritis in subjects. 

[0245] Forty-three additional allelic variants proximal to rs9 12428 were identified and subsequently 
allelotyped in osteoarthritis case and control sample sets as described in Examples 1 and 2. The 
polymorphic variants are set forth in Table 22. The chromosome positions provided in column four of 
Table 22 are based on Genome "Build 34" of NCBI's GenBank. 



TABLE 22 



dbSNP 
rs# 


Chromosome 


Position in SEQ 
ID NO: 3 


Chromosome 
Position 


Allele 
Variants 


rs1012628 


13 


243 


44917643 


C/t 


rs1 570976 


13 


10208 


44927608 


c/t 


rs912436 


13 


15049 


44932449 


C/t 


rs912435 


13 


15111 


4493251 1 


a/fl 


rs912433 


13 


15272 


44932672 


c/t 


rs9 12432 


13 


15287 


44932687 


a/q 


rs912431 


13 


15326 


44932726 


a/g 


rs9 12430 


13 


15327 


44932727 


c/t 


rsl 408225 


13 


17038 


44934438 


c/t 


rs998657 


13 


19391 


44936791 


a/q 


rsl 324006 


13 


21702 


44939102 


c/t 


rsl 92441 7 


13 


22431 


44939831 


c/a 


rs2038728 


13 


22881 


44940281 


a/g 
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dbSNP 
rs# 


Chromosome 


Position in SCQ 
roNO:3 


Chromosome 
Position 


Allele 
Variants 


rs9 12429 


13 


27744 


44945144 


a/t 


rs3742269 


13 


32564 


44949964 


a/Q 


rs3742270 


13 


32698 


44950098 


a/c 


rs3803192 


13 


33104 


44950504 


q/t 


rs3803191 


13 


33181 


44950581 


c/t 


rs754106 


13 


33256 


44950656 


c/t 


rs2005053 


13 


33543 


44950943 


c/t 


rs1 535793 


13 


35567 


44952967 


c/t 


rs1 886220 


13 


40085 


44957485 


c/t 


rs1886219 


13 


40482 


44957882 


a/t 


rs1 535792 


13 


45641 


44963041 


a/t 


rsl 535791 


13 


46059 


44963459 


a/g 


rs912428 


13 


48504 


44965904 


c/t 


rs1886218 


13 


48919 


44966319 


a/c 


rsl 570622 


13 


49693 


44967093 


c/t 


rs9 12427 


13 


49874 


44967274 


a/q 


rs912426 


13 


50020 


44967420 


a/q 


rs3068693 


13 


50616 


44968016 


-/ttt 


rs1 570621 


13 


50719 


44968119 


a/q 


rsl 886965 


13 


55511 


4497291 1 


c/t 


rsl 008849 


13 


65533 


44982933 


a/q 


rs9 12434 


13 


70529 


44987929 


a/c 


rs3889095 


13 


75591 


44992991 


c/t 


rs716223 


13 


77266 


44994666 


fl/t 


rs2897207 


13 


80368 


44997768 


fl/t 


rsl 570620 


13 


82475 


44999875 


a/q 


rsl 467605 


13 


92462 


45009862 


q/t 


rsl 467604 


13 


92480 


45009880 


c/t 


rsl 408224 


13 


95819 


45013219 


c/t 


rs1 408223 


13 


96275 


45013675 


c/t 



Assay for Verifying and Allelotvping SNPs 

[0246] The methods used to yerify and allelotype the 43 proximal SNPs of Table 22 are the same 
methods described in Examples 1 and 2 herein. The primers and probes used in these assays are provided 
in Table 23 and Table 24, respectively. 



TABLE 23 



dbSNP 


Forward 
PGR primer 


Reverse 
PCR primer 


Rsl 01 2628 


ACGTTGGATGGATTTTCTGTGTCCCCCAAG 


ACGTTGGATGTTGCAACAGAGAGAGCTCTG 


Rsl 570976 


ACGTTGGATGTGATGTGTCTGCTGTGTTGG 


ACGTTGGATGTTCACATGGCGAGGTCTTAG 


rs912436 


ACGTTGGATGCCATATAAGGTGGTTATGGG 


ACGTTGGATGCAAACAGGTTTTTCTGAGGC 


rs912435 


ACGTTGGATGCAAGCCAATATCCAAGACAG 


ACGTTGGATGAAAAACCTGTTTGTGAGGCC 
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dbSNP 
rs# 


Forward 
PCR primer 


Reverse 
PCR primer 


rs912433 


ACGTTGGATGTGCCTTCCATCCTTAACACG 


ACGTTGGATGGGCTTGAGCTTAGATATGGG 


rs912432 


ACGTTGGATGAAATAGTTGGGTTTTGTGCC 


AGGTTGGATGATTTGGTGTTAATTGCAGTG 


rs912431 


ACGTTGGATGTGGAAGGCACAAAACCCAAC 


ACGTTGGATGCAGAAGCTAGGCTTCCTATG 


rs912430 


ACGTTGGATGTGGAAGGCACAAAACCCAAC 


ACGTTGGATGCAGAAGCTAGGCTTCCTATG 


Rs1 408225 


ACGTTGGATGGGGCACCATGACAATATTCC 


ACGTTGGATGACACCTTGATCTTGGACTTC 


rs998657 


ACGTTGGATGACTGGGCCAGGGAGGAATAG 


ACGTTGGATGGTTGGGGAGATAATACAGAAG 


Rs1 324006 


ACGTTGGATGGCTGAAAACCCAAATGTGTG 


ACGTTGGATGCCAGCTATCAGCTCCATTTC 


RS1924417 


ACGTTGGATGACAAAAGCAAGCCTTCACAG 


ACGTTGGATGGTACTGTAAAAGGTACTGTG 


RS2038728 


ACGTTGGATGAAGGCTTTTGGACACAAGTC 


ACGTTGGATGGCACCTCTTATGATGTTCCC 


rs9 12429 


ACGTTGGATGTTCAATTCCCCAAAGCCCTC 


ACGTTGGATGGGCAAGTTCCATAACCTCTC 


RS3742269 


ACGTTGGATGGAGAAAAGAGAACGAGAAGG 


ACGTTGGATGTAAATGACAGCAGTCTGGAG 


RS3742270 


ACGTTGGATGCTAAAACCAAAGCTGAGGGG 


ACGTTGGATGTTGTGCTCCTGTGGGATAGG 


RS3803192 


ACGTTGGATGTCCTTTTGCTTGTGCGATGC 


ACGTTGGATGTGCTTCCCCATCAGTTCTTG 


Rs3803191 


ACGTTGGATGCTGTCTGTACATTACCAGGC 


ACGTTGGATGAATAGCAGCTGGAGGATCTC 


rs754106 


ACGTTGGATGTTCTTACCATCCAGCAAGGC 


ACGTTGGATGGCCTGGTAATGTACAGACAG 


RS2005053 


ACGTTGGATGCTGTTGCTAGCTTGGATTTG 


ACGTTGGATGTTCCCTGTCCTTTCTGGCAT 


Rs1 535793 


ACGTTGGATGAACAAAGAGGAACAGAGCCC 


ACGTTGGATGGCATAAGCCCCTTnrCCTAG 


Rs1 886220 


ACGTTGGATGTCACCGTGTTAGCGAGAATG 


ACGTTGGATGTAATCCCAGCAGTTTGGGAG 


RS1886219 


ACGTTGGATGTGTAACTGGATTTGCTGGAG 


ACGTTGGATGTACATCAATAGCCGAGGAAG 


Rs1 535792 


ACGTTGGATGCTGTATATCAGTGACTGTCC 


ACGTTGGATGCAGAGAAGAAAGATCTCAGC 


Rs1 535791 


ACGTTGGATGGAGGGTTTATCCTTACAATrG 


ACGTTGGATGTTTTAGGGTCCCTTGATAAG 


rs912428 


ACGTTGGATGACTACATCCATTCCAGGGAG 


AGGTTGGATGTGAGATCAGAGTGAGTTTAG 


RS1886218 


ACGTTGGATGTCCCGAAAAGAAGTCAAGAC 


AGGTTGGATGAGTCCAGGCAAAACAGTAAG 


Rs1 570622 


ACGTTGGATGATAGGTGGCACACTCTTTAG 


ACGTTGGATGGCGCAGTTTAGAAAAACCTG 


rs912427 


ACGTTGGATGTAGGGTTCTCGATGGGTATG 


ACGTTGGATGTTTGCGGTGGTCACTTTAGG 


rs912426 


ACGTTGGATGTTAGAGGATGCATAGGCCAG 


ACGTTGGATGAAGTGAGTTAGTGCATGGTC 


RS3068693 


ACGTTGGATGAAATTGGCCAGATGGAATCC 


AGGTTGGATGGTACGTTTAAGATCCCTGTC 


Rs1 570621 


ACGTTGGATGAATTAAGAATGGCAGCTATG 


ACGTTGGATGGTTTAAAACTAAAAAGAC 


Rs1 886965 


ACGTTGGATGCTGCTAAGGATATGTGTTTCC 


ACGTTGGATGACACCAGTGCTCAGTATTTG 


Rs1 008849 


ACGTTGGATGGCAGTTGTGAATTGTGCAGC 


ACGTTGGATGTGGTGCAGAACATGTCAGAG 


rs912434 


ACGTTGGATGTTCTGACATGTACAGACGTG 


ACGTTGGATGTCCTGGGAAATCTTTCCATC 


RS3889095 


ACGTTGGATGAAGGTAATGATATGTCCCCC 


ACGTTGGATGCGCATTTTACAGAGACATTG 


rs716223 


ACGTTGGATGACACTGTCTCTAGAAGCAGG 


ACGTTGGATGGAAGCAGGAAAAGAGTGAGG 


RS2897207 


ACGTTGGATGTCAGCCTCCAGAACTATGAG 


ACGTTGGATGAACAGAGAGAGAGCCTGTCT 


Rsl 570620 


ACGTTGGATGCTGTTCCTGCCTTGATATGG 


ACGTTGGATGGAAGGAAGTCTATTCAGCGC 


Rsl 467605 


ACGTTGGATGATGTTACAGGGTGGTAAGCG 


ACGTTGGATGTAAAGTTGCCACGCTTCTGC 


Rsl 467604 


ACGTTGGATGATATACGGCATGTTACAGGG 


ACGTTGGATGTTAAAGTTGCCAGGCTTCTG 


Rsl 408224 


ACGTTGGATGACTTCCCACTCCTCTAGACA 


ACGTTGGATGTATTGGCTGGGTAGCACTCC 


Rs1 408223 


ACGTTGGATGTCATTACCAGTTCCACAGAG 


ACGTTGGATGTTGAGACATCATGAGGAGTG 
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CTGTGTCCCCCAAGTCTTTG 
TTGGCATTTCTTTGAGAA 
AGGTGGTTATGGGTTTGTCACTCA 
TCCAAAAAGCCCAAGAAATTCT 
CCTTAACACGTTTATAATAGATTA 

GTGCCTTCCATCCTTAfl^CAC 
GGCACAAAACCCAACTA I I I I I C 
GCACAAAACCCAACTATTTTTCC 
CCTCAGACTGGGTGGCTTA 
CACCCACCTGAGGGAGGC 



GATACCTTGAAGAATTTTTAAAAC 



TTTAGGCACATTTGTACTTATAAA 



TGGACACAAGTCCATGCAACA 



CTGTGACAGGTGCTATTATCA 



TTTTGGACCGATTTCCGGTG 



GCTGACGGGGATTCCCTTTA 



GATGCACTAAAAGCAGCAATGT 



Rs3803191 



TCCAGCCTTCATATTTTCCTC 



ATCCAGCAAGGCACTTAGAAT 



TGTGGCCrrCAGATGCTTACAT 



GAGGAACAGAGCCCAAAGGACA 



CTGACGTCGTGATCCGCC 



ACTGGATTTGCTGGAGTTAAGAA 



Rs 1535792 



TATCAGTGACTGTCCTTTTCTTTT 



TTATCCTTACAATTGAAGAAAGGA 



CCATTCCAGGGAGACTCCCA 



GAAAACAAGTCAAGACATTTATTG 



CTGCCACACTCTTTAGATGAAGTT 



GGGAGATGACAGAACAAAACT 



AGGTGCCAAGTGTTAGAAGAAAC 



GCGTCACATTG I I I I I I I I I I I 



TCGGTCATAAGTTTAATGAAGG 



TGATTTTATGACTCACATTATTTC 
GTGAATTGTGCAGCTATAAACATG 
AGACGTGCCCAGCTATGATA 
TCCCCCATAACATTTCAGCAT 
GTGGTTTGTATTTCCAGTGTCA 
AACTATGAGAAATAAATGTGTGGG 
TTGATATGGTTCTTGGTTGTTGG 
GTAAGCGGTAGAAAGAAAAATAA 
ACGGCATGTTACAGGGTGGTAAG 
GGGCACACATTCAGAACTGCCC 
ACAGAGGAAGACCAAATGACA 
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Genetic Analysis 

[0247] Allelotyping.results fix)m the discovery cohort are shown for cases and controls in Table 25. 
The allele frequency for the A2 allele is noted in the fifth and sixth colxmms for osteoarthritis case pools 
and control pools, respectively, where "AF" is allele frequency. The allele frequency for the Al allele 
can be easily calculated by subtracting the A2 allele frequency from 1 (Al AF = 1-A2 AF). For example, 
the SNP rsl570976 has the following case and control allele frequencies: case Al (C) = 0.49; case A2 
(T) = 0.51; control Al (C) = 0.53; and control A2 (T) = 0.47, where the nucleotide is provided in 
paranthesis. Some SNPs are labeled "untyped" because of failed assays. 



TABLE 25 



dbSNP 
rs# 


SEQ ID NO: 


Chromosome 
Position 


A1/A2 
Allele 


F A2 
Case AF 


F A2 
Control AF 


Fp- 
Value 


rs1012628 


243 


44917643 


C/T 


070 


oTo 


— 0 768 — 


rs 1570976 


10208 


44927608 


C/T 


— 0^1 — 


0^47 


— 0 125 — 
= 


rs9 12436 


15049 


44932449 


C/T 


0^98 


— untpyed — 




rs912435 


15111 


4493251 1 


A/G 


0.64 


0.36 


~0.0001 


rs912433 


15272 


44932672 


CAT 


0.22 






rs9 12432 


15287 


44932687 


A/G 


0.46 


0.44 


0.282 


rs9 12431 


15326 


44932726 


A/G 


0.46 


0.46 


0.969 


rs9 12430 


15327 


44932727 


C/T 


0.20 


0.19 


0.584 


rs 1408225 


17038 


44934438 


C/T 








rs998657 


19391 


44936791 


A/G 


0.47 


0.44 


0.254 


rs 1324006 


21702 


44939102 


C/T 


0.55 


0.53 


0.419 


rs 19244 17 


22431 


44939831 


C/G 


0.53 


0.49 


0.108 


rs2038728 


22881 


44940281 


A/G 


0.34 


0.38 


0.082 


rs9 12429 


27744 


44945144 


ATT 








rs3742269 


32564 


44949964 


A/G 


0.83 


0.83 


0.967 


rs3742270 


32698 


44950098 


A/C 


0.53 


0.50 


0.170 


rs3803192 


33104 


44950504 


G/T 








rs3803191 


33181 


44950581 


C/T 








rs754106 


33256 


44950656 


C/T 


0.40 


0.41 


0.714 


rs2005053 


33543 


44950943 


C/T 


0.40 


0.40 


0.877 


rsl 535793 


35567 


44952967 


C/T 


0.26 


0.26 


0.910 


rs 1886220 


40085 


44957485 


C/T 








rsl 8862 19 


40482 


44957882 


Ml 


0.21 


0.22 


0.867 


rsl 535792 


45641 


44963041 


flJT 


0.73 


0.71 


0.550 


rs1 535791 


46059 


44963459 


MG 


0.08 


0.15 


0.009 


rs912428 


48504 


44965904 


C/T 








rs1886218 


48919 


44966319 


A/C 








rs1 570622 


49693 


44967093 


C/T 


0.73 


0.75 


0.451 


rs912427 


49874 


44967274 


A/G 


0.68 


0.70 


0.352 


rs912426 


50020 


44967420 


A/G 


0.76 


0.77 


0.680 


rs3068693 


50616 


44968016 


-nrr 


0.22 


0.21 


0.597 


rsl 570621 


50719 


44968119 


A/G 


0.19 


0.18 


0.569 


rsl 886965 


55511 


4497291 1 


C/T 








rs1 008849 


65533 


44982933 


A/G 


0.48 


0.43 


0.160 


rsgi2434 


70529 


44987929 


A/C 


0.23 


0.23 


0.988 


rs388g095 


75591 


44992991 


C/T 


0.90 


0.90 


0.880 


rs716223 


77266 


44994666 


G/T 


0.91 


0.90 


0.981 


rs2897207 


80368 


44997768 


G/T 


0.46 


0.46 


0.921 


rs1 570620 


82475 


44999875 


A/G 


0.67 


0.68 


0.738 


rs1 467605 


92462 


45009862 


G/T 


0.29 


0.22 


0.044 
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dbSNP 


Position in 
SEQ ID NO: 

3 


Chromosome 
Position 


A1/A2 
AUele 


F A2 
Case AF 


F A2 
Control AT 


Fp- 
Value 


rs1 467604 


92480 


45009880 


CfT 


0.68 


0.67 


0.537 


rs1 408224 


95819 


45013219 


err 


0.66 


0.65 


0.683 


rs1 408223 


96275 


45013675 


CAT 


0.29 


0.28 


0.587 



[0248] The CHDCl proximal SNPs were also allelotyped in the replication cohorts using the 
methods described herein and the primers provided in Tables 23 and 24. The replication allelotyping 
results for replication cohort #1 and replication cohort #2 are provided in Tables 26 and 27, respectively. 



TABLE 26 



dbSNP 
rs# 


Position in 
SEQ ID NO: 
3 


Chromosome 
Position 


A1/A2 
Allele 


F A2 
Case AF 


F A2 
Control AF 


F p- 


rs1012628 


243 


44917643 


CfT 


0.69 


0.72 


0.337 


rs 1570976 


10208 


44927608 


CfT 


0.48 


0.46 


0.490 


rs9 12436 


15049 


44932449 


CfT 








rs9 12435 


15111 


4493251 1 


A/G 


0.16 


untyped 


0.637 


rs9 12433 


15272 


44932672 


CfT 


0.28 


0.28 


0.984 


rs9 12432 


15287 


44932687 


A/G 


0.46 


0.42 


0.260 


rs9 12431 


15326 


44932726 


A/G 


0.46 


0.48 


0.602 


rs9 12430 


15327 


44932727 


CfT 


0.18 


0.20 


0.476 


rs1 408225 


17038 


44934438 


CfT 








rs998657 


19391 


44936791 


A/G 


0.46 


0.43 


0.380 


rs 1324006 


21702 


449391 02 


CfT 


0.54 


0.53 


0.811 


rs1924417 


22431 


44939831 


C/G 


0.51 


0.49 


0.440 


rs2038728 


22881 


44940281 


A/G 


0.35 


0.39 


0.181 


rs9 12429 


27744 


44945144 


AfT 








rs3742269 




44949964 






0.85 


0.911 


ISO 1 '^i.z. t U 


32698 




— — 


056 






rs3803192 


33104 


44950504 


GfT 








rs3803191 


33181 


44950581 


CfT 








rs754106 


33256 


44950656 


CfT 


0.40 


0.40 


0.827 


rs2005053 


33543 


44950943 


CfT 


0.40 


0.37 


0.328 


rs1 535793 


35567 


44952967 


CfT 


0.27 


0.24 


0.259 


rsl 886220 


40085 


44957485 


CfT 








rs1886219 


40482 


44957882 


AfT 


0.22 


0.19 


0.302 


rs1 535792 


45641 


44963041 


AfT 


0.73 


0.76 


0.435 


rsl 535791 


46059 


44963459 


A/G 


0.08 


0.08 


0.958 


rs912428 


48504 


44965904 


CfT 








rs1886218 


48919 


44966319 


A/C 








rs1 570622 


49693 


44967093 


CfT 


0.71 


0.79 


0.007 


rs912427 


49874 


44967274 


A/G 


0.65 


0.73 


0.007 


rs9 12426 


50020 


44967420 


A/G 


0.74 


0.80 


0.047 


rs3068693 


50616 


44968016 


-/TTT 


0.25 


0.21 


0.236 


rs1 570621 


50719 


44968119 


A/G 


0.22 


0.15 


0.028 


rsl 886965 


55511 


44972911 


CfT 








rsl 008849 


65533 


44982933 


A/G 


0.47 


untyped 


NA 


rs912434 


70529 


44987929 


A/C 


0.24 


0.19 


0.083 


rs3889095 


75591 


44992991 


CfT 


0.91 


0.91 


0.867 


rs716223 


77266 


44994666 


GfT 


0.91 


0.93 


0.598 


rs2897207 


80368 


44997768 


GfT 


0.48 


0.45 


0.321 


rsl 570620 


82475 


44999875 


A/G 


0.66 


0.72 


0.034 


rsl 467605 


92462 


45009862 


GfT 


0.29 


0.22 


0.044 


rs1 467604 


92480 


45009880 


err 


0.66 


0.70 


0.307 
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dbSNP 
rs# 


Position in 
SEQ ID NO: 
3 


Chromosome 
Position 


A1/A2 
AUele 


F A2 
Case AF 


F A2 
Control AF 


Fp- 
Value 


rs1 408224 


95819 


45013219 




0.64 


0.67 


0.312 


rs1 408223 


96275 


45013675 


cn- 


0.31 


0.23 


0.028 


TABLE 27 


rs 


Position in 
SEQ ID NO: 
3 


romosome 
osition 


A1/A2 

All 

Allele 


F A2 
Case AF 


Control AF 


* p- 
Value 


rs1012628 


243 


44917643 


CfT 


0.71 


0.68 


0.438 


rsl 570976 


10208 


44927608 


CfT 


0.55 


0.50 


0.159 


rs912436 


15049 


44932449 


CfT 








rs912435 


15111 


4493251 1 


A/G 


0.66 


untyped 




rs912433 


15272 


44932672 


CfT 


0.14 


0.17 


0.479 


rs912432 


15287 


44932687 


A/G 


0.47 


0.46 


0.806 


rs9 12431 


15326 


44932726 


A/G 


0.46 


0.44 


0.513 


rs912430 


15327 


44932727 


err 


0.23 


0.17 


0.084 


rs1 408225 


17038 


44934438 


cn 








rs998657 


19391 


44936791 


A/G 


0.48 


0.45 


0.518 


rs 1324006 


21702 


44939102 


cn 


0.55 


0.52 


0.324 


rsl 92441 7 


22431 


44939831 


C/G 


0.54 


0.49 


0.123 


rs2038728 


22881 


44940281 


A/G 


0.34 


0.37 


0.295 


rs912429 


27744 


44945144 


AH- 








rs3742269 


32564 


44949964 


A/G 


0.82 


0.82 


0.861 


rs3742270 


32698 


44950098 


A/C 


0.50 


0.49 


0.873 


rs3803192 


33104 


44950504 


GfT 








rs3803191 


33181 


44950581 


CfT 








rs754106 


33256 


44950656 


CfT 


0.41 


0.44 


0.346 


rs2005053 


33543 


44950943 


err 


0.40 


0.44 


0.302 


rs1 535793 


35567 


44952967 


CfT 


0.25 


0.31 


0.096 


rs1 886220 


40085 


44957485 


cn 








rs1886219 


40482 


44957882 


fiJT 


0.20 


0.27 


0.053 


rs1 535792 


45641 


44963041 


fiJT 


0.73 


0.63 


0.007 


rsl 535791 


46059 


44963459 


A/G 


NA 


0.27 


NA 


rs912428 


48504 


44965904 


cn 








rs1886218 


48919 


44966319 


A/C 








rsl 570622 


49693 


44967093 


err 


0.75 


0.67 


0.040 


rs912427 


49874 


44967274 


A/G 


0.71 


0.64 


0.059 


rs912426 


50020 


44967420 


A/G 


0.78 


0.72 


0.065 


rs3068693 


50616 


44968016 


-/TTT 


0.19 


0.21 


0.520 


rsl 570621 


50719 


44968119 


A/G 


0.15 


0.21 


0.077 


rsl 886965 


55511 


4497291 1 


cn 








rsl 008849 


65533 


44982933 


A/G 


0.49 


0.43 


0.138 


rs9 12434 


70529 


44987929 


A/C 


0.21 


0.28 


0.027 


rs3889095 


75591 


44992991 


cn 


0.89 


0.88 


0.583 


rs716223 


77266 


44994666 


G/T 


0.90 


0.87 


0.368 


rs2897207 


80368 


44997768 


G/T 


0.44 


0.48 


0.276 


rsl 570620 


82475 


44999875 


A/G 


0.70 


0.62 


0.026 


rsl 467605 


92462 


45009862 


G/T 








rsl 467604 


92480 


45009880 


cn 


0.71 


0.62 


0.018 


rsl 408224 


95819 


45013219 


cn 


0.68 


0.61 


0.060 


rsl 408223 


96275 


45013675 


cn 


0.27 


0.34 


0.023 



[0249] Allelotyping results were considered particularly significant with a calculated p-value of less 
than or equal to 0.05 for allelotype results. These values are indicated in bold. The allelotyping p-values 
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were plotted in Figure IC for the discovery cohort. The position of each SNP on the chromosome is 
presented on the x-axis. The y-axis gives the negative logarithm (base 10) of the p-value comparing the 
estimated allele in the case group to that of the control group. The minor allele frequency of the control 
group for each SNP designated by an X or other symbol on the graphs in Figure IC can be determined by 
consulting Table 25. For example, the left-most X on the left graph is at position 4491 7643. By 
proceeding down the Table from top to bottom and across the graphs from left to right the allele 
frequency associated with each symbol shown can be determined. 

[0250] To aid the interpretation, multiple lines have been added to the graph. The broken horizontal 
lines are drawn at two common significance levels, 0.05 and 0.01 . The vertical broken lines are drawn 
every 20kb to assist in the interpretation of distances between SNPs. Two other lines are drawn to 
expose linear trends in the association of SNPs to the disease. The light gray line (or generally bottom- 
most curve) is a nonlinear smoother through the data points on the graph using a local polynomial 
regression method (W.S. Cleveland, E. Grosse and W.M. Shyu (1992) Local regression models. Chapter 
8 of Statistical Models in S eds J.M. Chambers and T.J. Hastie, Wadsworth & Brooks/Cole.). The black 
line provides a local test for excess statistical significance to identify regions of association. This was 
created by use of a lOkb sliding window with Ikb step sizes. Within each window, a chi-square 
goodness of fit test was applied to compare the proportion of SNPs that were significant at a test wise 
level of 0.01 , to the proportion that would be expected by chance alone (0.05 for the methods used here). 
Resulting p- values that were less than 10"* were truncated at that value. 

[0251] Finally, the exons and introns of the genes in the covered region are plotted below each 
graph at the appropriate chromosomal positions. The gene boundary is indicated by the broken 
horizontal line. The exon positions are shown as thick, unbroken bars. An arrow is place at the 3' end of 
each gene to show the direction of transcription. 

Example 7 
ERG Region Proximal SNPs 

[0252] It has been discovered that SNP rs 1888475 in v-ets erythroblastosis virus E26 oncogene like 
(ERG) is associated with occurrence of osteoarthritis in subjects. One hundred sixty-six additional allelic 
variants proximal to rsl 888475 were identified and subsequently allelotyped in osteoarthritis case and 
control sample sets as described in Examples 1 and 2. The polymorphic variants are set forth in Table 
28. The chromosome positions provided in column four of Table 28 are based on Genome "Build 34" of 
NCBI's GenBank. 
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TABLE 28 



dbSNP 


Chromosome 


PosUion in SE\ 


Ch romosome 




rs# 




ID NO: 4 


Position 


Variants 


rs2898353 


21 


231 


38783681 


a/t 


rs960818 


21 


882 


38784332 


a/g 


rs960819 


21 


960 


38784410 


a/c 


rs2410034 


21 


1194 


38784644 


a/c 


rs2 836437 


21 


1530 


38784980 


a/g 


rs2 836438 


21 


1673 


38785123 


a/q 


rs2 836439 


21 


2096 


38785546 


c/t 


rs2 836440 


21 


2285 


38785735 


a/g 


rs2226683 


21 


5873 


38789323 


c/t 


rs2 836441 


21 


7256 


38790706 


a/g 


rs2836442 


21 


7988 


38791438 


a/q 


rs2 836443 


21 


8222 


38791672 


R/t 


rs2836444 


21 


8381 


38791831 


c/t 


rs3787906 


21 


8814 


38792264 


c/t 


rs3838108 


21 


8915 


38792365 


-/c 


rs2 836445 


21 


9642 


38793092 


a/g 


rs2 836446 


21 


9902 


38793352 


a/t 


rs3787908 


21 


10619 


38794069 


a/g 


rs2836447 


21 


10927 


38794377 


c/t 


rs2 836448 


21 


11032 


38794482 


c/t 


rs2 836450 


21 


14377 


38797827 


c/t 


rs2836451 


21 


15608 


38799058 


c/t 


rs1015022 


21 


15928 


38799378 


c/g 


rs2836452 


21 


16296 


38799746 


a/g 


rs2836453 


21 


17598 


38801048 


a/t 


rs3787909 


21 


19272 


38802722 


a/q 


rs2 836454 


21 


20084 


38803534 


a/g 


rs2 836455 


21 


20577 


38804027 


a/t 


rs2155718 


21 


28051 


38811501 


a/q 


rs2 836456 


21 


29466 


38812916 


a/q 


rs2836457 


21 


29530 


38812980 


c/t 


rs2 836458 


21 


29987 


38813437 


a/g 


rs2 032323 


21 


30012 


38813462 


c/t 


rs2051400 


21 


30322 


38813772 


fl/t 


rs2836459 


21 


32216 


38815666 


c/t 


rs2836460 






38815966 




rs2836461 


21 


32544 


38815994 


^ 


rs2836462 


21 


32746 


38816196 


a/q 


rs2836463 


21 


33137 


38816587 


g/t 


rs2836464 


21 


33538 


38816988 


a/q 


rs2836465 


21 


33798 


38817248 


c/t 


rs2636466 


21 


33802 


38817252 


a/c 


rs2836467 


21 


33964 


38817414 


c/t 


rs3827204 


21 


34132 


38817582 


a/g 


rs2836468 


21 


34210 


38817660 


c/t 


rs3787911 


21 


34317 


38817767 


a/g 
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dbSNP 




Position in SEQ 


Ctiromosome 


Allele 


rs# 




ID NO: 4 


Position 


Variants 


rs2836469 


21 


34499 


38817949 


C/t 


rs2836470 


21 


34753 


38818203 


a/c 


rs2212599 


21 


34845 


38818295 


c/t 


rs2836472 


21 


35335 


38818785 


c/t 


rs2836473 


21 


36423 


38819873 


c/t 


rs1 888469 


21 


36450 


38819900 


a/q 


rs1 888470 


21 


36481 


38819931 


g/t 


rs2032322 


21 


38447 


38821897 


c/g 


rs2410035 


21 


38784 


38822234 


c/t 


rs 1573332 


21 


39387 


38822837 


a/t 


rs2836474 


21 


39458 


38822908 


c/t 


rs2836475 


21 


39822 


38823272 


C/Q 


rs3787914 


21 


40305 


38823755 


c/q 


rs1 888471 


21 


40869 


38824319 


c/t 


rsl 888472 


21 


40926 


38824376 


C/t 


rs1 888473 


21 


41010 


38824460 


c/t 


rs1 888474 


21 


41134 


38824584 


C/t 


rs2836476 


21 


41984 


38825434 


a/q 


rs3787916 


21 


42172 


38825622 


a/t 


rs2836477 


21 


42753 


38826203 


q/t 


rs970043 


21 


43011 


38826461 


c/t 


rs22 12600 


21 


43176 


38826626 


a/q 


rs2836478 


21 


43320 


38826770 


q/t 


rs2836479 


21 


43381 


38826831 


a/t 


rsl 475877 


21 


44142 


38827592 


a/q 


rs2836480 


21 


44383 


38827833 


a/q 


rs2836481 


21 


44726 


38828176 


c/t 


rs2836483 


21 


45087 


38828537 


a/g 


r52836484 


21 


45141 


38828591 


eft 


rs2836485 


21 


45359 


38828809 


c/g 


rs2836486 


21 


45421 


38828871 


c/t 


rs2836487 


21 


45456 


38828906 


c/t 


rs1893199 


21 


45467 


38828917 


c/t 


rs2836488 


21 


45486 


38828936 


c/t 


rs1 893200 


21 


45709 


38829159 


a/q 


rsl 893201 


21 


45716 


38829166 


a/g 


rs2836489 


21 


47626 


38831076 


c/t 


rs1 888475 


21 


49413 


38832863 


a/g 


rs2836490 


21 


49796 


38833246 


c/t 


rs2836491 


21 


49962 


38833412 


a/q 


rs2836492 


21 


50075 


38833525 


c/t 


rs2836493 


21 


50093 


38833543 


a/q 


rs2836494 


21 


50571 


38834021 


c/t 


rs2836495 


21 


50615 


38834065 


a/q 


rs2898354 


21 


50780 


38834230 


a/q 


rs306539G 


21 


50851 


38834301 


-/ta 


rs2836496 


21 


51459 


38834909 


a/c 
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dbSNP 
rs# 


C hromosome 


Position in SEQ 
ID NO: 4 


Chromosome 
Position 


Allele 
Variants 


rs2836497 


21 


53193 


38836643 


c/t 


rs2836498 


21 


53702 


38837152 


crt 


rs2836499 


21 


53736 


38837186 


a/c 


rs2836500 


21 


53795 


38837245 


c/t 


rs2836501 


21 


54109 


38837559 


a/t 


rs2836502 


21 


54126 


38837576 


c/t 


rs2836503 


21 


54230 


38837680 


a/c 


rs2836504 


21 


54894 


38838344 


c/t 


rs3787917 


21 


55455 


38838905 


a/q 


rs2836505 


21 


55499 


38838949 


a/g 


rs2836606 


21 


56522 


38839972 


c/t 


rs2836507 


21 


56662 


38840112 


c/t 


rs2836508 


21 


56954 


38840404 


a/q 


rs2836509 


21 


57267 


38840717 


a/q 


rs2836510 


21 


58282 


38841732 


a/q 


rs283651 1 


21 


58916 


38842366 


a/c 


rs2212601 


21 


59544 


38842994 


c/fl 


rs22 12602 


21 


59666 


38843116 


c/t 


rs2226682 


21 


59913 


38843363 


a/t 


rs2836512 


21 


66846 


38850296 


a/q 


rs2836513 


21 


67245 


38850695 


q/t 


rs 1999328 


21 


67652 


38851102 


a/c 


rs22 12603 


21 


67955 


38851405 


a/g 


rs3787919 


21 


67966 


38851416 


a/c 


rs2836514 


21 


68420 


38851870 


a/g 


rs1023153 


21 


70226 


38853676 


a/g 


rsl 023372 


21 


70810 


38854260 


c/t 


rs2212604 


21 


72246 


38855696 


a/q 


rs2226684 


21 


73330 


38856780 


g/t 


rs2212605 


21 


73457 


38856907 


c/t 


rs2 187307 


21 


74389 


38857839 


a/g 


rs3065412 


21 


74638 


38858088 


-/aa 


rs2898355 


21 


74640 


38858090 


a/c 


rs2836518 


21 


75358 


38858808 


a/c 


rs3838110 


21 


75952 


38859402 


-/fl 


rs2836519 


21 


76098 


38859548 


a/q 


rs3827207 


21 


77836 


38861286 


a/q 


rs2836520 


21 


78449 


38861899 


a/c 


rs2836521 


21 


78507 


38861957 


q/t 


rs2836522 


21 


80031 


38863481 


q/t 


rs2836523 


21 


81695 


38865145 


c/t 


rs2836524 


21 


82775 


38866225 


a/q 


rs2836525 


21 


82795 


38866245 


a/q 


rs3833350 


21 


84611 


38868061 


-/c 


rs2836526 


21 


84657 


38868107 


c/t 


rs2836527 


21 


84693 


38868143 


a/c 


rs3834676 


21 


85020 


38868470 


-/t 
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dbSNP 
rs# 


romosome 


Position in SEQ 
ID NO: 4 


Chromosome 
Position 


Allele 
Variants 


rs2836528 


21 


85048 


38868498 


c/t 


rs3761364 


21 


85100 


38868550 


c/t 


rs2836529 


21 


85325 


38868775 


a/c 


rs2836530 


21 


85452 


38868902 


c/t 


rs3761366 


21 


85868 


38869318 


a/g 


rs2836531 


21 


85936 


38869386 


a/g 


rs2836532 


21 


85990 


38869440 


a/t 


rs2836533 


21 


86139 


38869589 


c/t 


rs2836534 


21 


86497 


38869947 


c/t 


rs2836535 


21 


87236 


38870686 


a/q 


rs2836536 


21 


87248 


38870698 


C/t 


rs3827208 


21 


87533 


38870983 


c/q 


rs715860 


21 


87912 


38871362 


a/q 


rs717231 


21 


88108 


38871558 


q/t 


rs2836537 


21 


88494 


38871944 


a/c 


rs2836538 


21 


89598 


38873048 


a/c 


rs2836539 


21 


90235 


38873685 


a/t 


rs2836540 


21 


91287 


38874737 


q/t 


rs2836541 


21 


91359 


38874809 


c/t 


rs2836542 


21 


92384 


38875834 


a/c 


rs2836543 


21 


92410 


38875860 


c/t 


rs881837 


21 


92900 


38876350 


c/t 


rs3949052 


21 


94495 


38877945 


a/q 


rs2065307 


21 


94512 


38877962 


a/q 


rs3216105 


21 


97777 


38881227 


-/a 


rs2073427 


21 


98333 


38881783 


c/t 



Assay for Verifying and Allelotvpinp SNPs 

[0253] The methods used to verify and allelotype the 166 proximal SNPs of Table 28 are the same 
methods described in Examples 1 and 2 herein. The primers and probes used in these assays are provided 
in Table 29 and Table 30, respectively. 



TABLE 29 



dbSNP 
rs# 


Forward 
PCR primer 


Reverse 
PCR primer 


rs2898353 


ACGTTGGATGAATGTGAATGTGGAGGTAGC 


ACGI IGGAIGCICCCI IGCIGGI 1 1 1 i 1 IG 


rs960818 


ACGTTGGATGTGGGATTTTTCCCAGAAGAG 


ACGTTGGATGGTGTGCAGAGAAACATGATG 


rs960819 


ACGTTGGATGCTGTCTCCCTTCTCTTTATC 


ACGTTGGATGCATCATGTTTCTCTGCACAG 


rs2410034 


ACGTTGGATGTTTAGAGACATTTCTCCTAG 


ACGTTGGATGTTAGGATGATGTTAGTTTGG 


rs2836437 


ACGTTGGATGAGCTTCTGCGATATCAGTGG 


ACGTTGGATGTTCCTGTCAGCACATTCTCC 


rs2836438 


ACGTTGGATGAACATGTCTTGGCCAAGCTC 


ACGTTGGATGCCACTGTGACCTCTGGATTT 


rs2836439 


ACGTTGGATGCCTAGTGTATAAAGTGATGC 


ACGTTGGATGTCCTTTCTAGGCACCAATAC 


rs2836440 


ACGTTGGATGAGATCCTAACCAACCACAGC 


ACGTTGGATGAGGTAGGTAGATAGAAGGCC 


rs2226683 


ACGTTGGATGAATATGGCTCCTATAGACAG 


ACGTTGGATGTTTTGGGTCACAAAATCAAG 
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dbSNP 
rs# 


Forward 
PCR primer 


Reverse 
PCR primer 


rs2836441 


ACGTTGGATGTTACCTTAATAGTGCTGGCC 


ACGTTGGATGACTTTCTGGTCAGAGAGAAG 


rs2836442 


ACGTTGGATGCAAGGACTCTAGGCTTACAG 


ACGTTGGATGGGGACATTTGTAGTCACTTC 


rs2836443 


ACGTTGGATGGGGCCCCATTACATGTCTAA 


ACGTTGGATGTTCGCTGTACTTGCTTCGAG 


rs2836444 


ACGTTGGATGCTGCAACCAGGAATTGTCAG 


ACGTTGGATGAGGACCCATAAAGAGGTGTG 


rs3787906 


ACGTTGGATGTGAAAAGAGCGGA<\ATCAAC 


ACGTTGGATGGTAAGAAAATCATTCTGTGG 


rs3838108 


ACGTTGGATGATGAATAAGATGGCAGGCTG 


ACGTTGGATGAAGCTGCCCAGATAAAACAG 


rs2836445 


ACGTTGGATGCATTTCCAAAATTAGACGCAG 


ACGTTGGATGAAAAAGAGAAAAACAGATGC 


rs2836446 


ACGTTGGATGGTGCCTTGTCCTATCAAGAG 


ACGTTGGATGAGCATCCAAGCCTGGTAATC 


rs3787908 


ACGTTGGATGAATCACCACACTAGACCAGC 


ACGTTG G ATG C ATG C AAG GG AAATGTGTGC 


rs2836447 


ACGTTGGATGATCTCCTCTCTTTGCTCTGC 


ACGTTGGATGGAGGAAGGTTAGGAGCTAAG 


rs2836448 


ACGTTGGATGTGTAGGGATGTATAGGGCAG 


ACGTTGGATGAAAGAGAGGAGATCCGTCTG 


rs2836450 


ACGTTGGATGTGTGGGCATCAGATGACAAC 


ACGTTGGATGATCCCGTTAAATGCACCGAC 


rs2836451 


ACGTTGGATGCAGACAAACAACTGTCACCC 


ACGTTGGATGGTATTTCCTTTTCTCGCCGC 


rs1Q15022 


ACGTTGGATGTCGAGCCAGCGTCTTTTATC 


ACGTTGGATGGTAACAGTCGTACATTCCGG 


rs2836452 


ACGTTGGATGATCACTGACACAGTCATGAG 


ACGTTGGATGCCAGTAACTTTGCAGGTTTG 


rs2836453 


ACGTTGGATGTGTATTTCCCAAGATGGCCC 


ACGTTGGATGCCTCACTTTCTGATGGAAGC 


rs3787909 


ACGTTGGATGACTTCTCAGTGTTCTGGCTG 


ACGTTGGATGCGTCACTCTCTGTTTCATGG 


rs2836454 


ACGTTGGATGAGGAATGATTCACAACCTCC 


ACGTTGGATGGAATGTTCAAATGTAGGGTGG 


rs2836455 


ACGTTGGATGGGTCTATTGCTGTGACATTT 


ACGTTGGATGCATCCCAATTTTTAAGCAAG 


rs2155718 


ACGTTGGATGAGAACTCTCACACACAGCTG 


ACGTTGGATGTGCCTCTTATTACAGCCCTG 


rs2836456 


ACGTTGGATGGGGATTGTCTGATCTCCTTG 


ACGTTGGATGCCAGCTTTCCTTTGTGCATG 


rs2836457 


ACGTTGGATGAACTCCTGGAATGAGTCACC 


ACGTTGGATGATGCACAAAGGAAAGCTGGG 


rs2836458 


ACGTTGGATGATCACTTAGAAGCCCAGCAG 


ACGTTGGATGTGATGCACACTGAGTGAAGG 


rs2032323 


ACGTTGGATGGTAGCCGCACTTTGAGATGC 


ACGTTGGATGAGGAGAGAGTCGAGGAGGAG 


rs2051400 


ACGTTGGATGACAGACCTCAGACCAAAGTC 


ACGTTGGATGTTTGTCCTAGAGTAACCCCC 


rs2836459 


ACGTTGGATGGCAAGAATGTTACTTTCTGG 


ACGTTGGATGCCATCAAATAGTTGGTTGTC 


rs2836460 


ACGTTGGATGCAATATCTGAGTTTCACCCC 


ACGTTGGATGGTAGATGAGAATTCCGTGTG 


rs2836461 


ACGTTGGATGGTTACCCACACGGAATTCTC 


ACGTTGGATGCCAGATCCAGGTTCTTTCTG 


rs2836462 


ACGTTGGATGTCTCCTCCGTATGTCTCCAT 


ACGTTGGATGATCCCGGAACTCTCTGTTTC 


rs2836463 


ACGTTGGATGGCACTATTTGACTTGAGCTC 


ACGTTGGATGAATTCAAGCCAGAAAGGCTC 


rs2836464 


ACGTTGGATGGTCTTTTTCACCCCAGTAAAG 


ACGTTGGATGATAAGCAAAAGGACCTTTGG 


rs2836465 


ACGTTGGATGTGAGCTCTTGTGTTTTGCCC 


ACGTTGGATGGAGAATTCTCCAGCCTTCTC 


rs2836466 


ACGTTGGATGTGAGCTCTTGTGTTTTGCCC 


ACGTTGGATGGAGAATTCTCCAGCCTTCTC 


rs2836467 


ACGTTGGATGGACTCTGCTCATTTCCTTGG 


ACGTTGGATGAAGAGTAGGGGTAGATGCAG 


rs3827204 


ACGTTGGATGTGAAGATCACACGTGGTGTA 


ACGTTGGATGGGGTGAATGCCAAAAAGAGG 


rs2836468 


ACGTTGGATGTAGAGGCAGGAAAGAGCATG 


ACGTTGGATGTTTTTGGCATTCACCCTCTC 


rs378791 1 


ACGTTGGATGTAACCCTCTTCTGGATTCGG 


ACGTTGGATGTCATGTGCTCTGAGAGCATC 


rs2836469 


ACGTTGGATGATTTCTCTACCTCATCCCCC 


ACGTTGGATGGGTTGAAGTCACGTAACAGC 


rs2836470 


ACGTTGGATGCCACTGTTAATCGTATTGCC 


ACGTTGGATGACGGACTGAAAGCCAAATGG 


rs22 12599 


ACGTTGGATGAGGAGTTATTCTTCCCCAAC 


ACGTTGGATGCAGTGGTCCATTAAGAATCC 


rs2836472 


ACGTTGGATGGAGTATCGTTCTCTATCATG 


ACGTTGGATGTAAAAGAGTCAGAGCAGGAC 


rs2836473 


ACGTTGGATGTCTCAGCCAGAGTTTTGACC 


ACGTTGGATGAATCAACGCCTCCTCTTCAG 


rs 1888469 


ACGTTGGATGACGACCAGGAAGGGTCTGAA 


ACGTTGGATGGAGGATCAGAGGCAGAAAAC 


rsl 888470 


ACGTTGGATGGCGTTGATTGCAGTTTTCTG 


AGGTTGGATGTTCTTTGGCCTCCGTGTAAG 


rs2032322 


ACGTTGGATGTGATACTCTGTTGAGCCTCC 


ACGTTGGATGGGGGAGCAGTGATGAGTTAT 


rs2410035 


ACGTTGGATGAATCACTTGAACCCAGGAGG 


ACGTTGGATGTTTTTGAGACGGAGTTTCGC 


rs1 573332 


ACGTTGGATGGGGTGAACTTTACAGAGAGG 


ACGTTGGATGCTGCCAGACAGTTTTGAGAC 
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rs# 


Forward 
PCR primer 


Reverse 
PCR primer 


rs2836474 


ACGTTGGATGAATTCTGCACAGGAGAGTCC 


ACGTTGGATGCAGGAAATGAAGATGTCGCC 


rs2836475 


ACGTTGGATGAGTTCTACATGGGAAGCTGC 


AGGTTGGATGATATCTGTGTCTACAGGCCC 


rs3787914 


ACGTTGGATGGGCTGAAGGCTAAAATCACC 


ACGTTGGATGGTCTGAGAAGTAGGAATGGC 


rs1 888471 


ACGTTGGATGACTGAGGCAATTGTGTAGAC 


ACGTTGGATGTTGACTTTGTTTTGAGAGGC 


rs1 888472 


ACGTTGGATGTTGCCTCTCAAAACAAAGTC 


ACGTTGGATGCTATTATTCTGGAAGCAGCC 


rs1 888473 


ACGTTGGATGAGAAAGTTCAGTTGTCAGCC 


ACGTTGGATGTGTTrGCTCCTGTGAGTAAC 


rs1 888474 


ACGTTGGATGTGTTATGTGAGTCCAGGGTG 


ACGTTGGATGTCTTGTTATGTGGGTGGGTG 


rs2836476 


ACGTTGGATGTTACCTGTGACCTCATTTGG 


ACGTTGGATGGAACACACAACATACGGTAC 


rs3787916 


ACGTTGGATGAAGGCATCTCAGTCATTCTC 


ACGTTGGATGTGAGTTTGACACAAAGAAGG 


rs2836477 


ACGTTGGATGTTTAGGTCTCCTGGATGATG 


ACGTTGGATGCCATGATTAGTGCATGAAGG 


rs970043 


ACGTTGGATGTATAACTCCCCTCTCTCCTG 


ACGTTGGATGAGAGCAGACCCTTATCAGAG 


rs2212600 


ACGTTGGATGGAAACAGGTGTTCATTTGGC 


ACGTTGGATGTCTGCATGAACCAGTAAGTC 


rs2836478 


ACGTTGGATGAGCTATTGAGTGTCACTTGC 


ACGTTGGATGCAGAAGCTTCTGACTTCAAC 


rs2836479 


ACGTTGGATGAGTAGCCATCCTAATAGGTG 


ACGTTGGATGAGCAAGTGACACTCAATAGC 


rs 1475877 


ACGTTGGATGAATCAACACTCCCCGTGTTC 


ACGTTGGATGGGTACCTAGAGTAGTCCAAG 


rs2836480 


ACGTTGGATGTACCAAACCCACTGTACATC 


ACGTTGGATGCATAACCTAACACATTGTGGG 


rs2836481 


ACGTTGGATGTAAGAAGTTCTTTCTCCCCC 


ACGTTGGATGGCTGCTTCITTCATAAGAGG 


rs2836483 


ACGrrGGATGCACTGAGGTAATCTCCAACC 


ACGTTGGATGGGTGGAGATATGGCTTGATG 


rs2836484 


ACGTTGGATGAAGCCCACCAGAGTCATCAA 


ACGTrGGATGACTACTGACCAGCTTTCCAG 


rs2836485 


ACGTTGGATGTTCTAAGTGAAGCCCTCCTC 


ACGTTGGATGTACAGCTGTGCAAACAGTTG 


rs2836486 


ACGTTGGATGCATGGTCTGTTGCCTCTAAG 


ACGTTGGATGCCCTAGCAnTTATGCATCC 


rs2836487 


ACGTTGGATGTGAATACCCACTAGGTCTCG 


ACGTTGGATGCCACCACTAAACTTAGAGGC 


rs1893199 


ACGTTGGATGGGCAACAGACCATGGTTTTG 


ACGTTGGATGCTTCCCTTCAACATGCACTG 


rs2836488 


ACGTTGGATGGGCAACAGACCATGGTTTTG 


ACGTTGGATGCTTCCCTTCAACATGCACTG 


rsl 893200 


ACGTTGGATGAGTTAAGTCTTCGCATAACC 


ACGTTGGATGCCTCTCACACACTAAATCrrG 


rsl 893201 


ACGTTGGATGGTCrrCGCATAACCAAAACAG 


ACGTTGGATGCCTCTCACACACTAAATCTTG 


rs2836489 


ACGTTGGATGGTCAACGATGGAGCTTGAAC 


ACGTTGGATGAGAAGAGATGTGGGCTTGTG 


rs 1888475 


ACGTTGGATGACCCCTGGCAAGTGAATTAC 


ACGTTGGATGGGGAGGTGGATGTTCTTATC 


rs2836490 


ACGTTGGATGAAAGGCAGAGCTAAAGCAAG 


ACGTTGGATGAGCACAACCCAGCAATGCAG 


rs2836491 


ACGTTGGATGACAACTTGGAGTGGAAAGGG 


ACGTTGGATGATCCAGATGGATTCCACAGC 


rs2836492 


ACGTTGGATGACATATGGGCATGGAAGAGC 


ACGTTGGATGAATCCATCTGGATGGAAGAC 


rs2836493 


ACGTTGGATGTTAAGAGTTCCGATGCTTGC 


ACGTTGGATGGTAATCTGGACTTCTCTTCC 


rs2836494 


ACGTTGGATGGTGCATTCATrTGAATTGCTG 


ACGTTGGATGCAGTCTTACTTAAAACTGAC 


rs2836495 


ACGTTGGATGGAATTTAACGAAACTTCAGC 


ACGTTGGATGGGATATTTTCAGGATATCTG 


rs2898354 


ACGTTGGATGTGTAACAAACCTGCACATCC 


ACGTTGGATGGGTACTTTCCAAATATCTGC 


rs3065390 


ACGTTGGATGCGAGACTCCATCTCAAAAAAG 


ACGTTGGATGTGGAAAGTACCAATAGCTrC 


rs2836496 


ACGTTGGATGTGGAGCTTAATGTGTTCCTG 


ACGTTGGATGGTTAGCCATGCATAAGACAG 


rs2836497 


ACGTTGGATGAGCCGGGATGACTGCTAGAC 


ACGTTGGATGAGATGAGGCTGAAGAAGTAA 


rs28364g8 


ACGTTGGATGGGTCCTGGGAAAATAGGATG 


ACGTrGGATGCACCCTTGCTCTTTCTGAAG 


rs2836499 


ACGTTGGATGACTAGTCAGAGCACAGTGAG 


ACGTTGGATGGCTCTCTCCTTCTTTGACTC 


rs2836500 


ACGTTGGATGGCTTCCTGGTTAGTAAGAGG 


ACGTTGGATGATCAACTCAGGGCTCTTCTC 


rs2836501 


ACGTTGGATGACTCACAAAGGTTGAGCTTG 


ACGTTGGATGGAGGTCCAGGTTGAAAGAAC 


rs2836502 


ACGTTGGATGGAGGTCCAGGTTGAAAGAAC 


ACGTTGGATGACTCACAAAGGTTGACCTTG 


rs2836503 


ACGTTGGATGGAGCAATTATCAACCCTACG 


ACGTTGGATGATTCTCCCCCTTCACTCTTG 


rs2836504 


ACGTTGGATGGAGTCTGGGTATGGAAAGAG 


ACGTTGGATGTTCCTAGAAATGGTGTCTGC 


rs3787917 


ACGTTGGATGTTTGGAGGAGGAATGCCTTG 


ACGTTGGATGCGCCCACAAACCTAAGAGAA 


rs2836505 


ACGTTGGATGTTTTCGACTGCTCCACTCTG 


ACGTTGGATGGCTCTCCCTCATTGTTCTTC 
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dbSNP 
rs# 


Forward 
PCR primer 


Reverse 
PCR primer 


rs2836506 


ACGTTGGATGGGCTAAGGGCATCATTTTATC 


ACGTTGGATGGTTTGCTGATTCATGGATGC 


rs2836507 


ACGTTGGATGAGCAAAGGTTCTGGTGTTGG 


ACGTTGGATGAAATGATGCCCTTAGCCCAG 


rs2836508 


ACGTTGGATGGTGTGATGATAI 1 1 I ICTCC 


ACGTTGGATGTTTCAGGTATTCCTCTTTGC 


rs2836509 


ACGTrGGATGTAAAGCTTTCTAAGTCAATG 


ACGTTGGATGTCATATGATAATGGTCTCTG 


rs2836510 


ACGTTGGATGCAGGGAGAGATCTAAACAGC 


ACGTTGGATGGCCAAAGCTATAACACGTGG 


rs2836511 


ACGTTGGATGAGAACCTGACTTTTGGAGTG 


ACGTTGGATGCTTCCTCATTGGTCAGAGTC 


rs22 12601 


ACGTTGGATGCCAGCCTTTAGAACTGTGAG 


ACGTTGGATGTGGGCTGCTGTAACAAAGTG 


rs22 12602 


ACGTTGGATGACTACAACCAGCCAGAGATG 


ACGTTGGATGCACAAACCTTGTGTGAACCC 


rs2226682 


ACGTTGGATGCCAAGATTGAACCAGGAAAG 


ACGTTGGATGCACAAAAGAATTCAGGAGGTG 


rs2836512 


ACGTTGGATGCCCCAAAACTTAGCATCCTG 


ACGTTGGATGTGTTCTGCCTGCACTTCAAC 


rs2836513 


ACGTTGGATGCACTGGGGTTAGCAAGAAAC 


ACGTTGGATGGACTGTGATTCACCCTGTCT 


rs 1999328 


ACGTTGGATGAGTTACAGCGCAAATTGAGG 


ACGTTGGATGGCCTTTATGACTCCATTTCTC 


rs22 12603 


ACGTTGGATGTGGAGGGTGTCTGTGAGTAG 


ACGTTGGATGTCATGGAGCAAGGTCTGTGG 


rs3787919 


ACGTTGGATGCCATCAGCTAGGATTCATGG 


ACGTTGGATGTCTGTGAGTACCCCACAATG 


rs2836514 


ACGTTGGATGCAGGTCTAACTAACTGATGAC 


ACGTTGGATGGCCTCTACTGTTATTTAAGG 


rs1023153 


ACGTTGGATGTACAAAAGTGACCTAGAGCC 


ACGTTGGATGTTCTTGCAGGACATTGTGGG 


rsl 023372 


ACGTTGGATGCAAATTCCAAAATTCTGGTTG 


ACGTTGGATGCTCAGAAGTAACATGTACTC 


rs22 12604 


ACGTTGGATGCAGACTTGAGCATATACCAC 


ACGTTGGATGACCCATGTGGGAAAATGTTG 


rs2226684 


ACGTTGGATGGGTGTTGGAAAAGGAACATC 


ACGTTGGATGTTAATGATAGTTCCCCTCAG 


rs2212605 


ACGTTGGATGATATGAGTGATTTGCATGGG 


ACGTTGGATGTGCATATAAGCTGTCTGCAC 


rs21 87307 


ACGTTGGATGGACATCCTGCAGCTTTAACC 


ACGTTGGATGCCTGGCAGTTTCAAGTAACG 


rs3065412 


ACGTTGGATGGGCTGAGATAGAATGTGCTC 


ACGTTGGATGTCTCGTGCTTTGTTCTGGAG 


rs2898355 


ACGTTGGATGGGCTGAGATAGAATGTGCTC 


ACGTTGGATGTCTCCTGCTTTGTTCTGGAG 


rs2836518 


ACGTTGGATGCACTTGTTGCTTCTTCCACC 


ACGTTGGATGATGCCAACCTTGCTGATGTC 


rs3838110 


ACGTTGGATGGAAGTAGTGAAGTGTTGCGG 


ACGTTGGATGAGCCTCACTGAATCTTAACG 


rs2836519 


ACGTTGGATGTGTTTCTCCTTCTCACTGGG 


ACGTTGGATGAAAGGCTACAGGAACTGAGC 


rs3827207 


ACGTTGGATGTGTAGTCTGCAGCTTCACCT 


ACGTTGGATGAGCGGGTGCTGAACATAGAT 


rs2836520 


ACGTTGGATGCCTGCAAAGGTGTTTGCTTC 


ACGTTGGATGGCCACCTAAI 1 1 1 ICGTCTC 


rs2836521 


ACGTTGGATGAAGAATAAGAAGCAAACACC 


ACGTTGGATGGTTTTAGGGGAAAGGCATAAG 


rs2836522 


ACGTTGGATGTGCATCTTTGGTTGTGACAG 


ACGTTGGATGGCACATCTACTCTTAGCATG 


rs2836523 


ACGTTGGATGTCTCTCrrTCTTTTCCCTAC 


ACGTTGGATGACTCTCAGTTATGATTTCTC 


rs2836524 


ACGTTGGATGGTGTGTTGGTAGAAACGTTC 


ACGTTGGATGGTCACCCCTTCAGATAATAAG 


rs2836525 


ACGTTGGATGCAGAGCCGAAAACATAGTTC 


ACGTTGGATGGTGTGTTGGTAGAAACGTTC 


rs3833350 


ACGTTGGATGGTTGTTCCTTTTGTCTTCTAG 


ACGTTGGATGGAATCATGTCCTTCAGTAAGC 


rs2836526 


AGGTTGGATGATTGTGTCCTGTCCTGCTAG 


ACGTTGGATGGACGGCTAGAAGACAAAAGG 


rs2836527 


ACGTTGGATGGTGTTTTATGTTCTAGCAGG 


ACGTTGGATGGATGCCTTTAGGCAAACATG 


rs3834676 


ACGTTGGATGAAGCTGAAAAGGATGTGCAG 


ACGTTGGATGACAGGGCATACTTCTCTATC 


rs2836528 


ACGTTGGATGCGAAAACTCATGCGATCTGC 


ACGTTGGATGTGGCGCTGAAGTACTCAATG 


rs3761364 


ACGTTGGATGAAACAGCACAGCTACCATTC 


ACGTTGGATGATGAGAAAATGTGTGTGGAG 


rs2836529 


ACGTTGGATGAGCGGTGTTTTAAAATGTCG 


ACGTTGGATGCAGAGCCCAAAAAAAATTTGG 


rs2836530 


ACGTTGGATGACAGACAGTGGTCAGAAGAT 


ACGTTGGATGAAAGATGCCTATAATCCAGG 


rs3761366 


ACGTTGGATGCAGGTGATAAAAAGCAAGTG 


ACGTTGGATGGCCATCAGTTC 1 1 1 1 1 1 GGC 


rs2836531 


ACGTTGGATGGCCTTCGAAAATGTCTCAAG 


ACGTTGGATGCACTTGC 1 1 1 1 lATCACCTG 


rs2836532 


ACGTTGGATGGAAAGACAGCCTTCGAAAATG 


ACGTTGGATGCAATGGCTCTTTGCAGTAAC 


rs2836533 


ACGTTGGATGTTTCTGACCTCTCACGGTAC 


ACGTTGGATGTGCAGATCTGGAGGTAGATG 


rs2836534 


ACGTTGGATGAGAAGAGGCTGGGAGAGGAT 


ACGTTGGATGTGCTGCTCTTAGGATAAGGG 


rs2836535 


ACGTTGGATGACAGGAGGAGTTGAGTGTTG 


ACGTTGGATGTAGAGGCACGGAGAAGATAG 



103 



PATENT 
Docket 524593008700 



dbSNP 
rs# 


Forward 
PCR primer 


Reverse 
PCR primer 


rs2836536 


ACGTTGGATGAAAAGCATGGGTACAGGAGG 


ACGTTGGATGTAGAGGCACGGAGAAGATAG 


rs3827208 


ACGTTGGATGGAGGATGAGAGGTACCTGAG 


ACGTTGGATGGGGATGATCAAACGTAGT 


rs715860 


ACGTTGGATGTTCTGGTGGAGGTTTCTTGG 


ACGTTGGATGCGAGACATGATCTCAAACCC 


rs717231 


ACGTTGGATGCAAGAGACTCAAACAGTTGC 


ACGTTGGATGTCATAGAAGTTACAGCAGCC 


rs2836537 


ACGTTGGATGTTGGTGTGTGATCACTCTGG 


ACGTTGGATGGAACCTAAGTTTCTCCCAGC 


rs2836538 


ACGTTGGATGGGTTAGAGCTTACGTAATTC 


ACGTTGGATGCTACTTGTGTCACTTCTTTG 


rs2836539 


ACGTTGGATGTTATCCTCCAAGAGCCTTAG 


ACGTTGGATGGGGCAAATG GAGTTCTTATT 


rs2836540 


ACGTTGGATGCCCAGTTGGTATCAGTGTTG 


ACGTTGGATGTGCTGAACATCGTTTGGAGG 


rs2836541 


ACGTTGGATGCTTGCAGTGACACCTTTGTG 


ACGTTGGATGGTACTGGCGAAGAGATGATG 


rs2836542 


ACGTTGGATGAGATGAGCCATTTCCTACTG 


ACGTTGGATGCAGCATGAGAAACTGAATGC 


rs2836543 


ACGTTGGATGAAATGGACTTCTTCAGTAGG 


ACGTTGGATGGATACAATTCAACCCATAGC 


rs881837 


ACGTTGGATGAATGGATGTGGCTCTTGAGG 


ACGTTGGATGTATGGAGGGACTTACGAAAG 


rs3949052 


ACGrrGGATGTTTTCAACG GAAACAGATGC 


ACGTTGGATGCCAAGTAAAATATTCAATCCCC 


rs2065307 


ACGTTGGATGTTTTCAACG GAAACAGATGC 


ACGTTGGATGCCAAGTAAAATATTCAATCCCC 


rs3216105 


ACGTTGGATGACCACCATGCCTGGCTAATT 


ACGTTGGATGGGCCTGGACAAAATAGTGAG 


rs2073427 


ACGTTGGATGTTTTGCTTGGGTGTTCTGCC 


ACGTTGGATGGGATTTACACTGGTGTTGGG 



TCCTGTCTTCAGTGCTTGATTCTG 



AGTAGATAACATAAAGTAACCAGC 



GCTATTCACCCTAGCTGTACATAG 



AAATGTAGCTGTAGTATCTTGAA 



TTCACACTCAACAACAAACACA 



TGGAAAGTAAGCTAGACCAAACAG 



GTATAAAGTGATGCTGCTTGC 



AACAATTGGGATATGTCTCTCCAC 



RS2226683 



GAGAGTTAATGTGCCCTACTT 



TAATAGTGCTGGGCATAATGC 



CTCTAGGCTTACAGTAAACAC 



TATAAGTTCAGGGTCACAGGTC 



TGTGTTCTTGGGGTCGCCT 



TAATGTAGGTGCTGAGAACTTAG 



GGCTGATTAAAATTCTGTTTCCCC 



RS2836445 



AGACGCAGTAAAACTTATGGAT 



GCCTTGTCCTATCAAGAGCCAAAG 



CATACAGTAGCTGTGGACAGC 



ATGTATTACATTGAGAACCATGTG 



TGTATAGGGCAGGGATAAAGAC 



AACAACAAATTTACTGATATCATC 



CTGTCACCCATTGACCTCAC 



CTTTTATCTGCAGTTGCACCC 



CGGGAAGATGGCTGCCTTC 
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dbSNP 


Extend 


Term 


rs# 


rmer 


IX 


Rs2836453 


CCAAGATGGCCCAGTAGGA 


CGT 


Rs3787909 


AAATAGTAAAATAAAAAGAGCTCC 


ACG 


Rs2836454 


CACAACCTCCCAAATGAATAAATC 


ACT 


Rs2836455 


TGCTGTGACATTTTAGTGCTTCTG 


CGT 


RS2155718 


CTCACACACAGCTGGAGTTTA 


ACT 


Rs2836456 


CGTTCTGAAGGTTTTGTGTACA 


ACT 


RS2836457 


GAGTCACCCGTCCCCTAGA 


ACT 


RS2836458 


ACAGAAGAGCCAGCCGACA 


ACT 


RS2032323 


TGCACAGTCACTGAAGGCC 


ACT 


RS2051400 


AAACACTATGTGACGCCACC 


ACT 


RS2836459 


AGAATGTTACTTTCTGGATTCTAC 


ACT 


RS2836460 


ATTGTAATTCTGCGTAAAACCC 


ACG 


RS2836461 


TACCCACAGGGAATTCTCATCTAC 


ACT 


RS2836462 


TCCGTATGTCTCCATCCATCTCA 


ACT 


RS2836463 


AAACTTAAATTGCTTTAATCAGGT 


ACT 


Rs2836464 


AATATCTTATCACTGCTCCTGTCT 


ACG 


RS2836465 


GCGCACrnriSTGTTTGCTTTAG 


ACT 


RS2836466 


TTTGGCGACTTTTGTGTTTGCT 


ACT 


RS2836467 


TTAATTTTCTTGTCTCTTTCTGTA 


ACT 


RS3827204 


CCCTCACATCTTCCCCGC 


ACT 


RS2836468 


GCAGGAAAGAGCATGGGCATTAAC 


ACT 


RS378791 1 


TACATCCAAAAGCCTGCCAG 


ACT 


RS2836469 


TCCTGCGAGATCCTGCTCA 


ACG 


RS2836470 


ACAAGCTTAATGTTTTGTTCAGA 


ACT 


Rs2212599 


TTCCCCAACAATAGTCAGAAAA 


ACT 


Rs2836472 


TTCTCTATCATGATGGAGTCC 


ACT 


Rs2836473 


GATGATGAACAGGGCTGTGA 


ACG 


Rs1 888469 


AAGGGTCTGAAGAGGAGGC 


ACT 


Rs1 888470 


GTTTTCTGGCTCTGATCCTCA 


ACT 


RS2032322 


CCTATAGGTAACGTGGCTTCT 


ACT 


Rs2410035 


AGGCAGAAGTTGCAGTGAAC 


ACG 


Rs1 573332 


GAGAGGCCAGAAAGCCTTC 


CGT 


Rs2836474 


GCACAGGAGAGTCCTCAATT 


ACG 


RS2836475 


CATGGGAAGCTGCTGAACTA 


ACT 


RS3787914 


ACAGTGTTTGAGCCCTCCTT 


ACT 


Rs1 888471 


AACTGACAGAAGAAAGAAAAATAT 


ACG 


Rs1 888472 


TGTGTTGGTGTATAAATCAAGATT 


ACG 


Rsl 888473 


CAGTTCTCAGCCAGACGATC 


ACG 


Rs1 888474 


GAGTCCAGGGTGCTAATTTC 


ACG 


RS2836476 


GGTGTTAGCCCTGGGTTCTAATAA 


ACG 


RS3787916 


TCTCTTATGTAAATACAAAGACG 


CGT 


RS2836477 


CCTCTTAAAATAGCCTGCCTTCA 


ACT 


rs970043 


GCTCCTTGACTCAAGTATTTC 


ACG 


RS2212600 


AAAACAACTTTCTCTCCCAAAC 


ACG 
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dbSNP 


Extend 


Term 








RS2836478 


CTTGCTTATCTTCAAGCAGTC 


CGT 


Rs2836479 


CCTAATAGGTGTGAAGTGTAAAA 


CGT 


Rs1 475877 


CTCCCCGTGTTCTGCATG C 


ACG 


Rs2836480 


CCCACTGTACATCTTACACTC 


ACT 


RS2836481 


TCCCCCTGAAATCCCATAGC 


ACT 


Rs2836483 


AGGTAATCTCCAACCAAACCT 


ACT 


RS2836484 


AGTCATCAAGCCATATCTCCA 


ACG 


RS2836485 


CTCCTCTGGGACGTCAGC 


ACT 


RS2836486 


CCTCTAAGTTTAGTGGTGGAT 


ACT 


RS2836487 


TGTTGGGTTCTACACATTCAAA 


ACT 


Rs1893ig9 


CAGACCATGGTTTTGAATGTG 


ACG 


RS2836488 


GTAGAACCCAACACAGAGCC 


ACG 


Rs 1893200 


AGTCTTCGCATAACCAAAACAGA 


ACT 


Rs1 893201 


CGCATAACCAAAACAGAAAAGAAC 


ACT 


Rs2836489 


CAAGAGCTCTTTTCAATTCCAG 


ACT 


Rs1 888475 


GACATCAAATGATTCCCCTGT 


ACT 


RS2836490 


GAGCCAAAGCTTTCCTGATG 


ACT 


RS2836491 


GTGGAAAGGGCACTGTGGT 


ACT 


RS2836492 


GGCATGGAAGAGCAAGCATC 


ACT 


Rs28364g3 


TCCGATGCTTGCTCTTCCAT 


ACT 


Rs28364g4 


TGAAGTTTCGTTAAATTCACTACA 


ACT 


RS2836495 


CTTCAGCAATTCAAATGAATGCAC 


ACT 


RS2898354 


TCCGGCACATATATCCTGGAAC 


ACT 


RS3065390 


AAACAAACAAACAAAAACAGTGTA 


ACT 


RS2836496 


GTGTTCCTGATGTTTCTGGAGT 


CGT 


RS2836497 


CTGCTAGACATTGTCAGTCC 


ACT 


RS2836498 


AATAGGATGAGTCAAAGAAGGAG 


ACT 


Rs2836499 


GAGAAGAGCCCTGAGTTGATAAA 


ACT 


RS2836500 


AGAGGATGAGCAATTTCAGGGA 


ACT 


Rs2836501 


CAAAGGTTGACCTTGTTTTCTAT 


CGT 


RS2836502 


AAGAACTTACATTTTATGGCTTC 


ACT 


Rs2836503 


GATTTGGGAGCAAGGGAGC 


ACT 


RS2836504 


AGAGTTAAAGATGACTCTAGGCTC 


ACT 


Rs3787917 


GCAGCCAGAGTGGAGCAGT 


ACG 


RS2836505 


AAGGCATTCCTCCTCCAAATCAC 


ACT 


Rs2836506 


GAAAATCAAATCAGTTTCTACAAG 


ACT 


Rs2836507 


GTGTTGGAATATTGTTGGCCT 


ACT 


RS2836508 


ATTCTCTACCATTTCATTCTCrrT 


ACT 


Rs2836509 


TTTCTAAGTCAATGTAGGCAAC 


ACT 


RS2836510 


CAGCTAGTTATCTTACTTGACC 


ACT 


RS283651 1 


AGCAGGTGACAACCCAGACAT 


ACT 


RS2212601 


TAAGTTTCTGTTGTTTATATGCCA 


ACT 


RS2212602 


CCAGCCAGAGATGGGATCA 


ACG 


RS2226682 


GATTGAACCAGGAAAGAAATAGTT 


CGT 
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AATGCCAGTTGCCATAGGATA 



ATAfliGAAGATGAGTACTATTATTG 



ATTG AG G G AAG AGTAAATG ATTTC 



TGTCTGTGAGTACCCCACAATGAA 



Rs3787919 



TCTGTGGCTTCAATGCTGGG 



ACAGACTTTAAGAAAATCACTGA 



GGGTCATCTGGTTACCTGTCCAA 



TTCCAAAATTCTGGTTGTGTTTT 



CTGCCCCTATACATACATAGCTTC 



AAAAACAATCTGCACAACAAATAT 



GCAGTGAATATGAACAAAAAAAAA 



CAGCTTTAACCTCACTCCAC 



AGTTACAAATCAGGTGGTGGTGG 



GTTACAAATCAGGTGGTGCTG 



TAGGAATCGGAGTCAATAATTTT 



GCTGCACAATGCCCCCCC 



CCTTCTCACTGGGTTCCTG 



TATCACCCCTGTGTCCTGC 



CACAAATAGATTATATATCCTGTT 



AATAAGAAGCAAACACCTTTGCA 



CCACCCCTTCAGAGAGTTG 



TCATATTGGTTGATCGTATTGGTT 



GATTTC AG G AATG AACTATGTTTT 



AGCCGAAAACATAGTTCATTCCTG 



CTTTTGTCTTCTAGGCGTCAG 



AGAACATAAAAGACAGAAATGCA 



Rs2836527 



TTATGTTCTAGCAGGACAGGA 



AAAAGGATGTGCAGATCGCAT 



ATCTGCACATCCTTTTCAGCTT 



CTACCATTCATTGAGTACTTCAG 



CTTCAAAATGTGGGTTGATACC 



GGTCAGAACATGCTGCTTTAT 



Rs3761366 



GTGATGGCTTGTAAAAATGTAAA 



GCATTTGTTACTGCAAAGAGCCAT 



AGCCTTGGAAAATGTCTCAAG 



CACACCCATTCCAACCCAAT 



RS2836534 



GCTGAAGGTTTCTGGGAGCA 



GAGGAGTTGAGTGTTGGAACCA 



ATGGGTACAGGAGGAGTTGA 



CACCCACCCCAATCACCC 



CTTGGTTATCCTTCAGTTTCCA 



GTCATTTAGTTTATGTCTTGGTTG 



GCTCATACGCCCTTGGTCTCTAAT 



AGCTTACGTAATTCAAATCAAGT 
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dbSNP 
rs# 


Extend 
Primer 


Term 
Mix 


RS2836539 


TTACACATTTGCACAATGAGGATA 


CGT 


RS2836540 


GTATCAGTGTTGAATGACTGGT 


ACT 


RS2836541 


TGACACGTTTGTGAATTGCTGAAG 


ACT 


RS2836542 


CCATTTCCTACTGAAGAAGTCCA 


ACT 


RS2836543 


CTTCTTCAGTAGGAAATGGCT 


ACG 


rs881837 


GGCTCTTGAGGCCATGCC 


ACG 


RS3949052 


ACAATTTCTCATGTTGTAAGGATT 


ACG 


Rs2065307 


GGAAACAGATGCCATTTACAATTT 


ACG 


RS3216105 


GCCTGGCTAATTTTTAAAAAAAAA 


CGT 


RS2073427 


CTGCCCCCACATGACCCA 


ACG 



Genetic Analysis 

[0254] Allelotyping results from the discovery cohort are shown for cases and controls in Table 31 . 
The allele frequency for the A2 allele is noted in the fifth and sixth columns for osteoarthritis case pools 
and control pools, respectively, where "AF" is allele fi-equency. The allele frequency for the Al allele 
can be easily calculated by subtracting the A2 allele frequency from 1 (Al AF = 1-A2 AF). For example, 
the SNP rs2898353 has the following case and control allele frequencies: case Al (A) = 0.79; case A2 
(T) = 0.21; control Al (A) = 0.81; and confrol A2 (T) = 0.19, where the nucleotide is provided in 
paranthesis. Some SNPs are labeled "untyped" because of failed assays. 



TABLE 31 



dbSNP 

rs# 


Position in 
SEQ ID NO: 
4 


Chromosome 
Position 


A1/A2 
Allele 


F A2 
Case AF 


F A2 
Control AF 


Fp- 
Value 


rs2898353 


231 


38783681 


PJJ 


0.21 


0.19 


0.560 


rs960818 


882 


38784332 


A/G 


0.59 


0.57 


0.330 


rs960819 


960 


38784410 


A/C 


0.13 


0.09 


0.101 


rs2410034 


1194 


38784644 


A/C 








rs2836437 


1530 


38784980 


MG 


0.14 


0.14 


0.956 


rs2836438 


1673 


38785123 


A/G 


0.79 


0.75 


0.077 


rs2836439 


2096 


38785546 


C/T 


0.70 


0.71 


0.508 


rs2836440 


2285 


38785735 


MG 


0.19 


0.18 


0.623 


rs2226683 


5873 


38789323 


C/T 


0.79 


0.76 


0.312 


rs2836441 


7256 


38790706 


A/G 


0.12 


0.12 


0.765 


rs2836442 


7988 


38791438 


A/G 


0.31 


0.30 


0.746 


rs2836443 


8222 


38791672 


G/T 


0.22 


0.23 


0.728 


rs2836444 


8381 


38791831 


C/T 


0.19 


0.20 


0.807 


rs3787906 


8814 


38792264 


C/T 


0.97 


untyped 


NA 


rs3838108 


8915 


38792365 


-/C 


0.58 


0.56 


0.425 


rs2836445 


9642 


38793092 


A/G 


0.32 


0.35 


0.190 


rs2836446 


9902 


38793352 


An- 


0.12 


0.14 


0.274 


rs3787908 


10619 


38794069 


A/G 








rs2836447 


10927 


38794377 


C/T 


0.68 


0.67 


0.816 


rs2836448 


11032 


38794482 


C/T 


0.12 


0.14 


0.235 


rs2836450 


14377 


38797827 


C/T 


0.70 


0.68 


0.460 


rs2836451 


15608 


38799058 


C/T 


0.92 


0.95 


0.157 
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dbSNP 
rs# 


Position in 
SEQ ID NO: 
4 


Chromosome 
Position 


A1/A2 
Allele 


F A2 


F A2 
Control AT 


F p- 
Value 


rs1015022 


15928 


38799378 


C/G 


0.31 


0.36 


0.072 


rs2836452 


16296 


38799746 


A/G 


0.18 


0.18 


0.822 


rs2836453 


17598 


38801048 


fiJT 


0.02 


0.02 


0.836 


rs3787909 


19272 


38802722 


A/G 


0.06 


0.03 


0.091 


rs2836454 


20084 


38803534 


A/G 


0.04 


0.03 


0.397 


rs2836455 


20577 


38804027 


MT 


0.17 


0.13 


0.050 


rs2155718 


28051 


38811501 


A/G 


0.78 


0.78 


0.950 


rs2836456 


29466 


38812916 


A/G 


0.94 


0.92 


0.569 


rs2836457 


29530 


38812980 


C/T 








rs2836458 


29987 


38813437 


A/G 


0.48 


0.46 


0.455 


rs2032323 


30012 


38813462 


C/T 








rs2051400 


30322 


38813772 


GfT 


0.03 


NA 


NA 


rs2836459 


32216 


38815666 


C/T 


0.19 


0.17 


0.319 


rs2836460 


32516 


38815966 


C/T 








rs2836461 


32544 


38815994 


A/G 








rs2836462 


32746 


38816196 


A/G 








rs2836463 


33137 


38816587 


GfT 


0.67 


0.72 


0.032 


rs2836464 


33538 


38816988 


A/G 


0.67 


0.67 


0.991 


rs2836465 


33798 


38817248 


C/T 








rs2836466 


33802 


38817252 


A/C 


0.39 


0.40 


0.627 


rs2836467 


33964 


38817414 


C/T 








rs3827204 


34132 


38817582 


A/G 


0.45 


0.42 


0.213 


rs2836468 


34210 


38817660 


C/T 


0.13 


0.14 


0.678 


rs3787911 


34317 


38817767 


A/G 


0.13 


0.12 


0.862 


rs2836469 


34499 


38817949 


err 


0.38 


0.40 


0.250 


rs2836470 


34753 


38818203 


A/C 


0.73 


0.74 


0.939 


rs22 12599 


34845 


38818295 


C/T 


0.66 


0.64 


0.474 


rs2836472 


35335 


38818785 


C/T 


0.40 


0.35 


0.071 


rs2836473 


36423 


38819873 


C/T 


0.53 


0.54 


0.755 


rs 1888469 


36450 


38819900 


/VG 


0.45 


0.49 


0.175 


rsi 888470 


36481 


38819931 


GfT 


0.17 


0.18 


0.623 


rs2032322 


38447 


38821897 


C/G 


0.50 


0.50 


0.879 


rs24 10035 


38784 


38822234 


C/T 








rs1 573332 


39387 


38822837 


/VT 


0.57 


0.58 


0.609 


rs2836474 


39458 


38822908 


C/T 


0.33 


0.35 


0.564 


rs2836475 


39822 


38823272 


C/G 


0.17 


0.14 


0.113 


rs3787914 


40305 


38823755 


C/G 


0.73 


0.73 


0.987 


rsi 888471 


40869 


38824319 


C/T 


0.29 


0.26 


0,175 


rs1 888472 


40926 


38824376 


C/T 


0.62 


0.63 


0.818 


rs1 888473 


41010 


38824460 


C/T 


0.63 


0.65 


0.435 


rs1 888474 


41134 


38824584 


C/T 


0.28 


0.23 


0.099 


rs2836476 


41984 


38825434 


A/G 


0.46 


0.44 


0.379 


rs3787916 


42172 


38825622 


/VT 


0.45 


0.43 


0.314 


rs2836477 


42753 


38826203 


G/T 


0.94 


0.96 


0.196 


rs970043 


43011 


38826461 


C/T 


0,04 


0.04 


0.549 


rs22 12600 


43176 


38826626 


A/G 








rs2836478 


43320 


38826770 


GfT 


0.76 


0.75 


0.914 


rs2836479 




38826831 


A/T 


0.44 


0.43 


0.670 


rsi 475877 


44142 


38827592 


A/G 


0.35 


0.32 


0.110 


rs2836480 


44383 


38827833 


A/G 


0.46 


0.43 


0.153 


rs2836481 


44726 


38828176 


C/T 


0.42 


0.40 


0.434 


rs2836483 


45087 


38828537 


A/G 


0.47 


0.45 


0.393 


rs2836484 


45141 


38828591 


C/T 


0.46 


0.47 


0.671 


rs2836485 


45359 


38828809 


C/G 


0.16 


0.17 


0.643 


rs2836486 


45421 


38828871 


C/T 








rs2836487 


45456 


38828906 


C/T 


0.02 


0.03 


0.758 


rs1893199 


45467 


38828917 


C/T 


0.62 


0.65 


0.220 
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dbSNP 
rs# 


Position in 
SEQ ID NO: 
4 


Chromosome 
Position 


A1/A2 
Allele 


F A2 


F A2 
Control AF 


F p- 


rs2836488 


45486 


38828936 


CfT 


0.25 


0.23 


0.360 


rsl 893200 


45709 


38829159 


A/G 


0.16 


0.14 


0.177 


rs1 893201 


45716 


38829166 


A/G 


0.84 


0.87 


0.060 


rs2836489 


47626 


38831076 


C/T 


0.29 


0.31 


0.502 


rs1888475 


49413 


38832863 


A/G 








rs2836490 


49796 


38833246 


err 


0.94 


0.93 


0.731 


rs2836491 


49962 


38833412 


A/G 


0.10 


0.08 


0.219 


rs2836492 


50075 


38833525 


err 


0.20 


0.22 


0.518 


rs2836493 


50093 


38833543 


A/G 


0.95 


0.94 


0.850 


rs2836494 


50571 


38834021 


err 


0.72 


0.70 


0.536 


rs2836495 


50615 


38834065 


A/G 


0.82 


0.78 


0.142 


rs2898354 


50780 


38834230 


A/G 


0.25 


0.25 


0.728 


rs3065390 


50851 


38834301 


-/TA 


0.10 


0.11 


0.845 


rs2836496 


51459 


38834909 


A/C 


0.80 


0.84 


0.064 


rs2836497 


53193 


38836643 


C/T 


0.65 


0.65 


0.935 


rs2836498 


53702 


38837152 


C/T 


0.43 


0.44 


0.682 


rs2836499 


53736 


38837186 


A/C 


0.33 


0.30 


0.169 


rs2836500 


53795 


38837245 


CAT 








rs2836501 


54109 


38837559 


A/T 


0.36 


0.34 


0.234 


rs2836502 


54126 


38837576 


C/T 


0.31 


0.29 


0.427 


rs2836503 


54230 


38837680 


fiJC 


0.32 


0.29 


0.194 


rs2836504 


54894 


38838344 


C/T 


0.51 


0.54 


0.170 


rs3787917 


55455 


38838905 


A/G 


0.56 


0.60 


0.137 


rs2836505 


55499 


38838949 


A/G 


0.73 


0.78 


0.022 


rs2836506 


56522 


38839972 


C/T 


0.52 


0.56 


0.145 


rs2836507 


56662 


38840112 


C/T 


0.51 


0.54 


0.173 


rs2836508 


56954 


38840404 


A/G 


0.53 


0.56 


0.376 


rs2836509 


57267 


38840717 


A/G 


0.35 


0.31 


0.089 


rs2836510 


58282 


38841732 


A/G 


0.65 


0.59 


0.034 


rs2836511 


58916 


38842366 


A/C 


0.32 


0.30 


0.315 


rs22 12601 


59544 


38842994 


C/G 


0.45 


0.46 


0.568 


rs22 12602 


59666 


38843116 


C/T 


0.30 


0.28 


0.644 


rs2226682 


59913 


38843363 


A/T 


0.38 


0.35 


0.164 


rs2836512 


66846 


38850296 


A/G 


0.94 


0.94 


0.896 


rs2836513 


67245 


38850695 


G/T 


0.23 


0.22 


0.713 


rs1 999328 


67652 


38851102 


A/C 


0.79 


0.79 


0.973 


rs2212603 


67955 


38851405 


A/G 


0.73 


0.72 


0.776 


rs3787919 


67966 


38851416 


A'C 








rs2836514 


68420 


38851870 


A/G 


0.52 


0.54 


0.319 


rsl 0231 53 


70226 


38853676 


A/G 


0.09 


0.09 


0.985 


rs1 023372 


70810 


38854260 


C/T 


0.83 


0.81 


0.518 


rs22 12604 


72246 


38855696 


A/G 


0.68 


0.71 


0.237 


rs2226684 


73330 


38856780 


G/T 


0.83 


0.81 


0.462 


rs22 12605 


73457 


38856907 


err 


0.82 


0.85 


0.255 


rs2 187307 


74389 


38857839 


A/G 


0.13 


0.13 


0.869 


rs3065412 


74638 


38858088 


-lAA 








rs2898355 


74640 


38858090 


A/C 


0.96 


0.94 


0.413 


rs2836518 




38858808 


A/C 


0.10 


0.12 


0.261 


rs3838110 


75952 


38859402 


-/G 


0.66 


0.67 


0.790 


rs2836519 


76098 


38859548 


A/G 


0.60 


0.61 


0.509 


rs3827207 


77836 


38861286 


A/G 


0.62 


0.63 


0.575 


rs2836520 


78449 


38861899 


A/C 








rs2836521 


78507 


38861957 


G/T 


0.07 


0.08 


0.551 


rs2836522 


80031 


38863481 


G/T 


0.11 


0.08 


0.155 


rs2836523 


81695 


38865145 


C/T 








rs2836524 


82775 


38866225 


A/G 


0.05 


0.04 


0.321 


rs2836525 


82795 


38866245 


A/G 


0.11 


0.11 


0.875 
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dbSNP 


Position in 
SEQ ID NO: 
4 


Chr 


A1/A2 


F A2 


F A1 


b p- 


rs# 


Po^t^on""* 


Allele 


Case AF 


on ro 


Value 


rs3833350 


84611 


38868061 


-/C 








rs2836526 


84657 


38868107 


C/T 


0.83 


0.86 


0.292 


rs2836527 


84693 


38868143 


A/C 


0.08 


0.08 


0.936 


rs3834676 


85020 


38868470 


-n 


0.80 


0.83 


0.191 


rs2836528 


85048 


38868498 


err 


0.84 


0.87 


0.089 


rs3761364 


85100 


38868550 


CfT 


0.06 


0.04 


0.159 


rs2836529 


85325 


38868775 


A/C 


0.09 


0.06 


0.100 


rs2836530 


85452 


38868902 


CfT 








rs3761366 


85868 


38869318 


A/G 


0.06 


0.04 


0.179 


rs2836531 


85936 


38869386 


A/G 


0.49 


0.50 


0.729 


rs2836532 


85990 


38869440 


fiJT 


0.30 


0.29 


0.766 


rs2836533 


86139 


38869589 


C/T 


0.47 


0.48 


0.751 


rs2836534 


86497 


38869947 


CfT 


0.87 


0.87 


0.874 


rs2836535 


87236 


38870686 


A/G 


0.93 


0.92 


0.628 


rs2836536 


87248 


38870698 


CfT 


0.86 


0.84 


0.474 


rs3827208 


87533 


38870983 


C/G 


0.51 


0.53 


0.459 


rs715860 


87912 


38871362 


A/G 


0.08 


0.09 


0.627 


rs717231 


88108 


38871558 


GfT 


0.65 


0.67 


0.382 


rs2836537 


88494 


38871944 


A/C 


0.43 


0.40 


0.239 


rs2836538 


89598 


38873048 


A/C 








rs2836539 


90235 


38873685 


fiJT 


0.98 


0.97 


0.796 


rs2836540 


91287 


38874737 


GfT 








rs2836541 


91359 


38874809 


CfT 


0.07 


0.06 


0.403 


rs2836542 


92384 


38875834 


A/C 


0.36 


0.38 


0.418 


rs2836543 


92410 


38875860 


CfT 


0,54 


0.50 


0.202 


rs881837 


92900 


38876350 


CfT 


0.29 


0.28 


0.639 


rs3949052 


94495 


38877945 


A/G 








rs2065307 


94512 


38877962 


A/G 








rs3216105 


97777 


38881227 


-/A 


0.32 


0.28 


0.265 


rs2073427 


98333 


38881783 


C/T 


0.09 


0.07 


0.242 



[0255] The ERG proximal SNPs were also allelotyped in the replication cohorts using the methods 
described herein and the primers provided in Tables 29 and 30. The repUcation allelotyping results for 
rephcation cohort #1 and replication cohort #2 are provided in Tables 32 and 33, respectively. 



TABLE 32 



dbSNP 
rs# 


Position in 
SEQ ID NO: 
4 


Chromosome 
Position 


A1/A2 
AUele 


F A2 

Case AF 


F A2 
Control AF 


Fp- 
Value 


rs2898353 


231 


38783681 


fiJT 


0.19 


0.19 


0.773 


rs960818 


882 


38784332 


A/G 


0.59 


0.57 


0.600 


rs960819 


960 


38784410 


A/C 


0.07 


NA 


0.132 


rs2410034 


1194 


38784644 


MC 








rs2836437 


1530 


38784980 


A/G 


0.14 


0.14 


0.957 


rs2836438 


1673 


38785123 


A/G 


0.80 


0.77 


0.402 


rs2 836439 


2096 


38785546 


C/T 


0.68 


0.73 


0.089 


rs2836440 


2285 


38785735 


A/G 


0.20 


0.18 


0.421 


rs2226683 


5873 


38789323 


C/T 


0.78 


0.76 


0.622 


rs2836441 


7256 


38790706 


A/G 


0.12 


0.12 


0.946 


rs2836442 


7988 


38791438 


A/G 


0.30 


0.32 


0.674 


rs2836443 


8222 


38791672 


GfT 


0.22 


0.25 


0.332 


rs2836444 


8381 


38791831 


CfT 


0.20 


0.20 


0.908 


rs3787906 


8814 


38792264 


CfT 


0.97 


untyped 


NA 
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dbSNP 
rs# 


Position in 
SEQ ID NO: 
4 


Chromosome 
Position 


A1/A2 
Allele 


F A2 
Case AF 


F A2 
Control AF 


F p- 
Value 


rs3838108 


8915 


38792365 


-/C 


0.58 


0.56 


0.604 


rs2836445 


9642 


38793092 


A/G 


0.33 


0.37 


0.211 


rs2836446 


9902 


38793352 


A/T 


0.13 


0.15 


0.481 


rs3787908 


10619 


38794069 


A/G 








rs2836447 


10927 


38794377 


C/T 


0.67 


0.67 


0.843 


rs2836448 


11032 


38794482 


err 


0.13 


0.15 


0.521 


rs2836450 


14377 


38797827 


C/T 


0.67 


0.67 


0.989 


rs2836451 


15608 


38799058 


C/T 


0.92 


0.95 


0.214 


rs1 015022 


15928 


38799378 


C/G 


0.30 


0.36 


0.076 


rs2836452 


16296 


38799746 


A/G 


0.18 


0.18 


0.982 


rs2836453 


17598 


38801048 


A/T 


0.02 


untyped 


NA 


rs3787909 


19272 


38802722 


A/G 


0.06 


0.03 


0.110 


rs2836454 


20084 


38803534 


A/G 


0.03 


0.03 


0.746 


rs2836455 


20577 


38804027 


A/T 


0.17 


0.12 


0.080 


rs2155718 


28051 


38811501 


A/G 


0.78 


0.79 


0.747 


rs2836456 


29466 


38812916 


A/G 


0.91 


0.91 


0.915 


rs2836457 


29530 


38812980 


C/T 








rs2836458 


29987 


38813437 


A/G 


0.48 


0.47 


0.626 


rs2032323 


30012 


38813462 


CfT 








rs2051400 


30322 


38813772 


G/T 


0.02 


untyped 


NA 


rs2836459 


32216 


38815666 


C/T 


0.20 


0.16 


0.278 


rs2836460 


32516 


38815966 


C/T 








rs2836461 


32544 


38815994 


A/G 








rs2836462 


32746 


38816196 


A/G 








rs2836463 


33137 


38816587 


G/T 


0.67 


0.75 


0.011 


rs2836464 


33538 


38816988 


A/G 


0.66 


0.68 


0.586 


rs2836465 


33798 


38817248 


C/T 








rs2836466 


33802 


38817252 


A/C 


0.39 


0.41 


0.507 


rs2836467 


33964 


38817414 


C/T 








rs3827204 


34132 


38817582 


A/G 


0.45 


0.41 


0.229 


rs2836468 


34210 


38817660 


C/T 


0.13 


0.14 


0.736 


rs3787911 


34317 


38817767 


A/G 


0.14 


0.13 


0.856 


rs2836469 


34499 


38817949 


C/T 


0.37 


0.41 


0.168 


rs2836470 


34753 


38818203 


A/C 


0.72 


0.73 


0.854 


rs22 12599 


34845 


38818295 


err 


0.63 


0.65 


0.636 


rs2836472 


35335 


38818785 


C/T 


0.41 


0.35 


0.145 


rs2836473 


36423 


38819873 


C/T 


0.51 


0.54 


0.291 


rsl 888469 


36450 


38819900 


A/G 


0.45 


0.49 


0.281 


rsl 888470 


36481 


38819931 


G/T 


0.17 


0.17 


0.949 


rs2032322 


38447 


38821897 


C/G 


0.51 


0.53 


0.476 


rs24 10035 


38784 


38822234 


C/T 








rsl 573332 


39387 


38822837 


AfT 


0.56 


0.60 


0.279 


rs2836474 


39458 


38822908 


C/T 


0.33 


0.36 


0.330 


rs2836475 


39822 


38823272 


C/G 


0.18 


0.13 


0.049 


rs3787914 


40305 


38823755 


C/G 


0.73 


0.74 


0.977 


rsl 888471 


40869 


38824319 


C/T 


0.31 


0.26 


0.134 


rsl 888472 


40926 


38824376 


C/T 


0.62 


0.65 


0.247 


rsl 888473 


41010 


38824460 


C/T 


0.63 


0.67 


0.210 


rs1 888474 


41134 


38824584 


C/T 


0.28 


0.21 


0.091 


rs2836476 


41984 


38825434 


A/G 


0.47 


0.44 


0.346 


rs3787916 


42172 


38825622 


AfT 


0.46 


0.41 


0.171 


rs2836477 


42753 


38826203 


G/T 


0.94 


0.97 


0.294 


rs970043 


43011 


38826461 


C/T 


0.05 


0.03 


0.331 


rs22 12600 


43176 


38826626 


A/G 








rs2836478 


43320 


38826770 


G/T 


0.75 


0.75 


0.983 


rs2836479 


43381 


38826831 


A/T 


0.44 


0.43 


0.752 


rsl 475877 


44142 


38827592 


A/G 


0.35 


0.31 


0.166 
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dbSNP 
rs# 


Position in 
SEQ ID NO: 
4 


Chromosome 
Position 


A1/A2 
Allele 


F A2 


F A2 
Control AF 


F p- 
Value 


rs2836480 


44383 


38827833 


A/G 


0.45 


0.41 


0.254 


rs2836481 


44726 


38828176 


C/T 


0.42 


0.39 


0.330 


rs2836483 


45087 


38828537 


A/G 


0.46 


0.46 


0.797 


rs2836484 


45141 


38828591 


C/T 


0.45 


0.47 


0.553 


rs2836485 


45359 


38828809 


C/G 


0.18 


0.18 


0.993 


rs2836486 


45421 


38828871 


C/T 








rs2836487 


45456 


38828906 


C/T 


0.03 


0.03 


0.955 


rs1893199 


45467 


38828917 


C/T 


0.61 


0.67 


0.071 


rs2836488 


45486 


38828936 


C/T 


0.27 


0.23 


0.246 


rs1 893200 


45709 


38829159 


A/G 


0.16 


0.13 


0.203 


rsl 893201 


45716 


38829166 


A/G 


0.83 


0.89 


0.021 


rs2836489 


47626 


38831076 


C/T 


0.30 


0.31 


0.702 


rs1888475 


49413 


38832863 


A/G 








rs2836490 


49796 


38833246 


C/T 


0.94 


0.95 


0.662 


rs2836491 


49962 


38833412 


A/G 


0.10 


0.06 


0.038 


rs2836492 


50075 


38833525 


C/T 


0.20 


0.22 


0.651 


rs2836493 


50093 


38833543 


A/G 


0.93 


0.95 


0.397 


rs2836494 


50571 


38834021 


C/T 


0.73 


0.71 


0.592 


rs2836495 


50615 


38834065 


A/G 


0.81 


0.77 


0.212 


rs2898354 


50780 


38834230 


A/G 


0.24 


0.24 


0.827 


rs3065390 


50851 


38834301 


-/TA 


0.10 


0.11 


0.743 


rs2836496 


51459 


38834909 


A/C 


0.78 


0.86 


0.022 


rs2836497 


53193 


38836643 


err 


0.65 


0.66 


0.733 


rs2836498 


53702 


38837152 


C/T 


0.44 


0.46 


0.576 


rs28364g9 


53736 


38837186 


A/C 


0.33 


0.29 


0.200 


rs2836500 


53795 


38837245 


CfT 








rs2836501 


54109 


38837559 


fiJT 


0.36 


0.32 


0.167 


rs28365Q2 


54126 


38837576 


C/T 


0.31 


0.27 


0.206 


rs2836503 


54230 


38837680 


MC 


0.32 


0.28 


0.173 


rs2836504 


54894 


38838344 


err 


0.50 


0.57 


0.033 


rs3787917 


55455 


38838905 


A/G 


0.56 


0.62 


0.033 


rs2836505 


55499 


38838949 


A/G 


0.72 


0.81 


0.004 


rs2836506 


56522 


38839972 


c/r 


0.52 


0.58 


0.093 


rs2836507 


56662 


38840112 


C/T 


0.51 


0.56 


0.134 


rs2836508 


56954 


38840404 


A/G 


0.53 


0.58 


0.170 


rs2836509 


57267 


38840717 


NG 


0.35 


0.30 


0.136 


rs2836510 


58282 


38841732 


A/G 


0.62 


0.56 


0.035 


rs283651 1 


58916 


38842366 


AJC 


0.33 


0.30 


0.273 


rs2212601 


59544 


38842994 


C/G 


0.44 


0.46 


0.675 


rs22 12602 


59666 


38843116 


C/T 


0.29 


0.27 


0.571 


rs2226682 


59913 


38843363 


A/T 


0.38 


0.33 


0.127 


rs2836512 


66846 


38850296 


AJG 


0.93 


0.96 


0.261 


rs2836513 


67245 


38850695 


G/T 


0.23 


0.22 


0.692 


rs1 999328 


67652 


38851102 


AJC 


0.79 


0.80 


0.618 


rs22 12603 


67955 


38851405 


A/G 


0.73 


0.74 


0.676 


rs3787919 


67966 


38851416 


AJC 








rs2836514 


68420 


38851870 


AJG 


0.51 


0.57 


0.044 


rs1023153 




38853676 


A/G 


0.09 


0.09 


0.699 


rs1 023372 


70810 


38854260 


C/T 


0.82 


untyped 


NA 


rs2212604 


72246 


38855696 


A/G 


0.67 


0.73 


0.063 


rs2226684 


73330 


38856780 


G/T 


0.82 


0.82 


0.992 


rs22 12605 


73457 


38856907 


C/T 


0.83 


0.86 


0.180 


rs2 187307 


74389 


38857839 


A/G 


0.14 


0.13 


0.901 


rs3065412 


74638 


38858088 


-lAA 








rs2898355 


74640 


38858090 


AJC 


0.95 


0.93 


0.442 


rs2836518 


75358 


38858808 


A/C 


0.11 


0.14 


0.248 


rs3838110 


75952 


38859402 


-/G 


0.65 


0.68 


0.399 
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Position in 
SEQ ID NO: 
4 


Chromosome 


A1/A1 


F Al 


F A2 


t p- 


rs# 


Position 


All 

e e 


ase 


Oontroi Ah 


Value 


rs2836519 


76098 


38859548 


A/G 


0.59 


0.64 


0.134 


rs3827207 


77836 


38861286 


A/G 


0.60 


0.64 


0.205 


rs2836520 


78449 


38861899 


A/C 








rs2836521 


78507 


38861957 


G/r 


0.08 


0.09 


0.765 


rs2836522 


80031 


38863481 


GifT 


0.12 


0.07 


0.033 


rs2836523 


81695 


38865145 


CfT 








rs2836524 


82775 


38866225 


A/G 


0.05 


0.04 


0.539 


rs2836525 


82795 


38866245 


A/G 


0.12 


0.09 


0.179 


rs3833350 


84611 


38868061 


-/C 








rs2836526 


84657 


38868107 


C/T 


0.83 


0.85 


0.536 


rs2836527 


84693 


38868143 


A/C 


0.08 


0.07 


0.444 


rs3834676 


85020 


38868470 


-rr 


0.79 


0.82 


0.270 


rs2836528 


85048 


38868498 


CfT 


0.82 


0.86 


0.130 


rs3761364 


85100 


38868550 


C/T 


0.08 


0.05 


0.132 


rs2836529 


85325 


38868775 


A/C 


0.09 


0.07 


0.214 


rs2836530 


85452 


38868902 


C/T 








rs3761366 


85868 


38869318 


A/G 


0.07 


0.04 


0.259 


rs2836531 


85936 


38869386 


A/G 


0.49 


0.50 


0.741 


rs2836532 


85990 


38869440 


A/T 


0.30 


0.30 


0.921 


rs2836533 


86139 


38869589 


CfT 


0.48 


0.48 


0.843 


rs2836534 


86497 


38869947 


CfT 


0.86 


0.89 


0.374 


rs2836535 


87236 


38870686 


A/G 


0.91 


0.91 


. 0.933 


rs2836536 


87248 


38870698 


C/T 


0.86 


0.86 


0.945 


rs3827208 


87533 


38870983 


C/G 


0.51 


0.55 


0.183 


rs7 15860 


87912 


38871362 


A/G 


0.07 


0.07 


0.893 


rs717231 


88108 


38871558 


G/T 


0.65 


0.68 


0.506 


rs2836537 


88494 


38871944 


A/C 


0.43 


0.39 


0.251 


rs2836538 


89598 


38873048 


A/C 








rs2836539 


90235 


38873685 


ATT 


0.98 


0.98 


0.910 


rs2836540 


91287 


38874737 


G/T 








rs2B36541 


91359 


38874809 


CfT 


0.09 


0.06 


0.324 


rs2836542 


92384 


38875834 


A/C 


0.37 


0.41 


0.365 


rs2836543 


92410 


38875860 


CfT 


0.54 


0.55 


0.863 


rs881837 


92900 


38876350 


C/T 


0.30 


0.28 


0.673 


rs3949052 


94495 


38877945 


A/G 








rs2065307 


94512 


38877962 


A/G 








rs3216105 


97777 


38881227 


-/A 


0.31 


0.29 


0.603 


rs2073427 


98333 


38881783 


CfT 


0.09 


0.06 


0.249 



TABLE 33 



dbSNP 
rs# 


Position in 
SEQ ID NO: 


Chromosome 
Position 


A1/A2 
Allele 


F A2 
Case AF 


F A2 
Control AF 


Fp- 
Value 


rs2898353 


231 


38783681 


AfT 


0.22 


0.21 


0.629 


rs960818 


882 


38784332 


A/G 


0.59 


0.55 


0.351 


rs960819 


960 


38784410 


A/C 


0.12 


0.01 




rs2410034 


1194 


38784644 


A/C 








rs2836437 


1530 


38784980 


AJC 


0.14 


0.14 


0.989 


rs2836438 


1673 


38785123 


A/G 


0.78 


0.71 


0.047 


rs2836439 


2096 


38785546 


C/T 


0.72 


0.68 


0.265 


rs2836440 


2285 


38785735 


A/G 


0.18 


0.19 


0.789 


rs2226683 


5873 


38789323 


CfT 


0.80 


0.77 


0.342 


rs2836441 


7256 


38790706 


A/G 


0.11 


0.12 


0.559 


rs2836442 


7988 


38791438 


A/G 


0.32 


0.28 


0.269 


irs283e443 


8222 


38791672 


G/T 


0.23 


0.21 


0.504 
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dbSNP 
rs# 


Position in 
SEQ ID NO: 
4 


Chromosome 
Position 


A1/A2 
Allele 


F A2 
Case AF 


F A2 
Control At 


* p- 
Value 


rs2836444 


8381 


38791831 


C/T 


0.19 


0.19 


0.829 


rs3787906 


8814 


38792264 


C/T 


0.97 


untyped 




rs3838108 


8915 


38792365 


-/C 


0.58 


0.55 


0.526 


rs2836445 


9642 


38793092 


A/G 


0.30 


0.32 


0.722 


rs2836446 


9902 


38793352 


ATT 


0.11 


0.14 


0.425 


rs3787908 


10619 


38794069 


A/G 








rs2836447 


10927 


38794377 


err 


0.68 


0.68 


0.908 


rs2836448 


11032 


38794482 


C/T 


0.11 


0.14 


0.302 


rs2836450 


14377 


38797827 


C/T 


0.73 


0.70 


0.314 


rs2836451 


15608 


38799058 


CAT 


0.93 


0.94 


0.499 


rs1015022 


15928 


38799378 


C/G 


0.33 


0.35 


0.527 


rs2836452 


16296 


38799746 


A/G 


0.17 


0.18 


0.750 


rs2836453 


17598 


38801048 


AH" 


0.02 


0.02 


0.934 


rs3787909 


19272 


38802722 


A/G 


0.05 


0.04 


0.546 


rs2836454 


20084 


38803534 


A/G 


0.05 


0.03 


0.379 


rs2836455 


20577 


38804027 


A/T 


0.17 


0.15 


0.472 


rs2155718 


28051 


38811501 


A/G 


0.79 


0.78 


0.704 


rs2836456 


29466 


38812916 


A/G 


0.97 


0.94 


0.174 


rs2836457 


29530 


38812980 


C/T 








rs2836458 


29987 


38813437 


A/G 


0.48 


0.45 


0.532 


rs2032323 


30012 


38813462 


C/T 








rs2051400 


30322 


38813772 


G/T 


0.04 


0.02 


0.476 


rs2836459 


32216 


38815666 


C/T 


0.19 


0.18 


0.921 


rs2836460 


32516 


38815966 


CAT 








rs2836461 


32544 


38815994 


A/G 








rs2836462 


32746 


38816196 


A/G 








rs2836463 


33137 


38816587 


G/T 


0.68 


0.68 


0.988 


rs2836464 


33538 


38816988 


A/G 


0.69 


0.66 


0.430 


rs2836465 


33798 


38817248 


C/T 








rs2836466 


33802 


38817252 


A/C 


0.39 


0.39 


0.948 


rs2836467 


33964 


38817414 


C/T 








rs3827204 


34132 


38817582 


A/G 


0.45 


0.43 


0.614 


rs2836468 


34210 


38817660 


C/T 


0.12 


0.12 


0.879 


rs378791 1 


34317 


38817767 


A/G 


0.12 


0.11 


0.901 


rs2836469 


34499 


38817949 


C/T 


0.38 


0.39 


0.914 


rs2836470 


34753 


38818203 


A/C 


0.75 


0.74 


0.960 


rs22 12599 


34845 


38818295 


C/T 


0.71 


0.64 


0.095 


rs2836472 


35335 


38818785 


C/T 


0.40 


0.36 


0.321 


rs2836473 


36423 


38819873 


C/T 


0.56 


0.53 


0.433 


rs 1888469 


36450 


38819900 


A/G 


0.45 


0.49 


0.399 


rs 1888470 


36481 


38819931 


G/T 


0.16 


0.19 


0.356 


rs2032322 


38447 


38821897 


C/G 


0.50 


0.45 


0.190 


rs24 10035 


38784 


38822234 


C/T 








rs 1573332 


39387 


38822837 


ATT 


0.58 


0.56 


0.554 


rs2836474 


39458 


38822908 


C/T 


0.34 


0.33 


0.762 


rs2836475 


39822 


38823272 


C/G 


0.15 


0.14 


0.817 


rs3787914 


40305 


38823755 


C/G 


0.73 


0.73 


0.934 


rsl 888471 


40869 


38824319 


C/T 


0.28 


0.27 


0.760 


rsl 888472 


40926 


38824376 


C/T 


0.63 


0.58 


0.302 


rsl 888473 


41010 


38824460 


C/T 


0.63 


0.62 


0.683 


rsl 888474 


41134 


38824584 


C/T 


0.27 


0.26 


0.853 


rs2836476 


41984 


38825434 


A/G 


0.46 


0.45 


0.838 


rs3787916 


42172 


38825622 


ATT 


0.44 


0.45 


0.827 


rs2836477 


42753 


38826203 


G/T 


0.94 


0.95 


0.505 


rs970043 


43011 


38826461 


err 


0.04 


0.04 


0.848 


rs2212600 


43176 


38826626 


A/G 








rs283647B 


43320 


38826770 


GH- 


0.76 


0.75 


0.893 
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dbSNP 
rs# 


Position in 
SEQ ID NO: 
4 


Chromosome 
Position 


A1/A2 
Allele 


F A2 
Case AF 


F A2 
Control AF 


F p- 


rs2836479 


43381 


38826831 


A/T 


0.44 


0.43 


0.801 


rs1 475877 


44142 


38827592 


AJG 


0.35 


0.33 


0.450 


rs2836480 


44383 


38827833 


AJG 


0.47 


0.44 


0.444 


rs2836481 


44726 


38828176 


C/T 


0.41 


0.41 


0.999 


rs2836483 


45087 


38828537 


A/G 


0.48 


0.44 


0.306 


rs2836484 


45141 


38828591 


C/T 


0.46 


0.46 


0.939 


rs2836485 


45359 


38828809 


C/G 


0.15 


0.17 


0.483 


rs2836486 


45421 


38828871 


C/T 








rs2836487 


45456 


38828906 


C/T 


NA 


0.03 


NA 


rs1893199 


45467 


38828917 


C/T 


0.63 


0.62 


0.868 


rs2836488 


45486 


38828936 


C/T 


0.23 


0.22 


0.913 


rs1 893200 


45709 


38829159 


A/G 


0.17 


0.16 


0.653 


rsl 893201 


45716 


38829166 


A/G 


0.85 


0.85 


0.947 


rs2836489 


47626 


38831076 


C/T 


0.27 


0.30 


0.597 


rs 1888475 


49413 


38832863 


A/G 








rs2836490 


49796 


38833246 


C/T 


0.94 


0.91 


0.196 


rs2836491 


49962 


38833412 


A/G 


0.09 


0.11 


0.493 


rs2B36492 


50075 


38833525 


C/T 


0.20 


0.21 


0.669 


rs28364g3 


50093 


38833543 


A/G 


0.96 


0.93 


0.211 


rs2836494 


50571 


38834021 


C/T 


0.70 


0.69 


0.697 


rs2836495 


50615 


38834065 


A/G 


0.82 


0.80 


0.510 


rs2898354 


50780 


38834230 


A/G 


0.27 


0.26 


0.846 


rs3065390 


50851 


38834301 


-fTA 


0.11 


0.10 


0.936 


rs2836496 


51459 


38834909 


A/C 


0.81 


0.80 


0.746 


rs2836497 


53193 


38836643 


C/T 


0.66 


0.64 


0.756 


rs2836498 


53702 


38837152 


C/T 


0.41 


0.40 


0.844 


rs2836499 


53736 


38837186 


A/C 


0.32 


0.30 


0.567 


rs2836500 


53795 


38837245 


C/T 








rs2836501 


54109 


38837559 


A/T 


0.36 


0.36 


0.917 


rs2836502 


54126 


38837576 


cn 


0.31 


0.32 


0.738 


rs2836503 


54230 


38837680 


A/C 


0.32 


0.31 


0.730 


rs2836504 


54894 


38838344 


err 


0.52 


0.50 


0.620 


rs3787917 


55455 


38838905 


A/G 


0.57 


0.56 


0.759 


rs2836505 


55499 


38838949 


A/G 


0.74 


0.74 


0.982 


rs2836506 


56522 


38839972 


C/T 


0.52 


0.53 


0.907 


rs2836507 


56662 


38840112 


C/T 


0.51 


0.52 


0.785 


rs2836508 


56954 


38840404 


A/G 


0.53 


0.52 


0.709 


rs2836509 


57267 


38840717 


AJG 


0.35 


0.33 


0.453 


rs2836510 


58282 


38841732 


A/G 


0.68 


0.65 


0.457 


rs283651 1 


58916 


38842366 


A/C 


0.32 


0.31 


0.832 


rs2212601 


59544 


38842994 


C/G 


0.45 


0.47 


0.717 


rs22 12602 


59666 


38843116 


C/T 


0.30 


0.30 


0.994 


rs2226682 


59913 


38843363 


A/T 


0.39 


0.38 


0.801 


rs2836512 


66846 


38850296 


A/G 


0.94 


0.91 


0.184 


rs2836513 


67245 


38850695 


G/T 


0.23 


0.23 


0.949 


rsl 999328 


67652 


38851102 


A/C 


0.80 


0.77 


0.487 


rs2212603 


67955 


38851405 


A/G 


0.74 


0.70 


0.289 


rs378791 9 




38851416 










rs2836514 


68420 


38851870 


A/G 


0.53 


0.49 


0.363 


rs1023153 


70226 


38853676 


A/G 


0.08 


0.09 


0.611 


rs 1023372 


70810 


38854260 


C/T 


0.84 


0.81 


0.315 


rs22 12604 


72246 


38855696 


A/G 


0.69 


0.68 


0.641 


rs2226684 


73330 


38856780 


G/T 


0.85 


0.81 


0.216 


rs22 12605 


73457 


38856907 


C/T 


0.82 


0.82 


0.927 


rs2 187307 


74389 


38857839 


A/G 


0.12 


0.13 


0.685 


rs3065412 


74638 


38858088 


-/AA 








rs2898355 


74640 


38858090 


A/C 


0.96 


0.96 


0.893 
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rs# 


Position in 
SEQ ID NO: 
4 


PosiHon""* 


A1/A2 

All 


F A2 


F A2 
Control AF 


V f~ 
a ue 


rs2836518 


75358 


38858808 


A/C 


0.10 


0.11 


0.823 


rs3838110 


75952 


38859402 


-/G 


0.68 


0.65 


0.457 


rs2836519 


76098 


38859548 


A/G 


0.60 


0.57 


0.357 


rs3827207 


77836 


38861286 


A/G 


0.64 


0.61 


0.449 


rs2836520 


78449 


38861899 


A/C 








rs2836521 


78507 


38861957 


G/T 


0.06 


0.07 


0.625 


rs2836522 


80031 


38863481 


G/T 


0.09 


0.10 


0.810 


rs2836523 


81695 


38865145 


CfT 








rs2836524 


82775 


38866225 


A/G 


0.05 


0.04 


0.419 


rs2836525 


82795 


38866245 


A/G 


0.10 


0.14 


0.132 


rs3833350 


84611 


38868061 


-/C 








rs2836526 


84657 


38868107 


err 


0.83 


0.86 


0.342 


rs2836527 


84693 


38868143 


A/C 


0.08 


0.11 


0.209 


rs3834676 


85020 


38868470 


-rr 


0.81 


0.84 


0.442 


rs2836528 


85048 


38868498 


err 


0.86 


0.88 


0.350 


rs3761364 


85100 


38868550 


CfT 


0.04 


0.03 


0.643 


rs2836529 


85325 


38868775 


A/C 


0.08 


0.06 


0.271 


rs2836530 


85452 


38868902 


C/T 








rs3761366 


85868 


38869318 


PJG 


0.06 


0.04 


0.473 


rs2836531 


85936 


38869386 


AJG 


0.49 


0.49 


0.915 


rs2836532 


85990 


38869440 


ATT 


0.31 


0.28 


0.446 


rs2836533 


86139 


38869589 


C/T 


047 


0.48 


0.810 


rs2836534 


86497 


38869947 


CfT 


0.88 


0.84 


0.149 


rs2836535 


87236 


38870686 


A/G 


0.94 


0.92 


0.378 


rs2836536 


87248 


38870698 


C/T 


0.86 


0.82 


0.311 


rs3827208 


87533 


38870983 


C/G 


0.51 


0.49 


0.598 


rs715860 


87912 


38871362 


AJG 


0.09 


0.11 


0.463 


rs7 17231 


88108 


38871558 


G/T 


0.65 


0.67 


0.588 


rs2836537 


88494 


38871944 


A/C 


0.42 


0.41 


0.694 


rs2836538 


89598 


38873048 


A/C 








rs2836539 


90235 


38873685 


A/T 


0.97 


0.97 


0.749 


re2836540 


91287 


38874737 


G/T 








rs2836541 


91359 


38874809 


C/T 


0.05 


0.05 


0.895 


rs2836542 


92384 


38875834 


A/C 


0.34 


0.34 


0.998 


rs2836543 


92410 


38875860 


C/T 


untyped 


0.43 


NA 


rs881837 


92900 


38876350 


C/T 


0.29 


0.28 


0.811 


rs3949052 


94495 


38877945 


A/G 








rs2065307 


94512 


38877962 


A/G 








rs3216105 


97777 


38881227 


-/A 


0.32 


0.28 


0.273 


rs2073427 


98333 


38881783 


C/T 


0.08 


0.07 


0.700 



[0256] Allelotyping results were considered particularly significant with a calculated p-value of less 
than or equal to 0.05 for allelotype results. These values are indicated in bold. The allelotyping p-values 
were plotted in Figure ID for the discovery cohort. The position of each SNP on the chromosome is 
presented on the x-axis. The y-axis gives the negative logarithm (base 10) of the p-value comparing the 
estimated allele in the case group to that of the control group. The minor allele frequency of the control 
group for each SNP designated by an X or other symbol on the graphs in Figure ID can be determined by 
consulting Table 3 1 . For example, the left-most X on the left graph is at position 3878368 1 . By 
proceeding down the Table from top to bottom and across the graphs from left to right the allele 
frequency associated with each symbol shown can be determined. 
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[0257] To aid the interpretation, multiple lines have been added to the graph. The broken horizontal 
lines Eire drawn at two common significance levels, 0.05 and 0.01 . The vertical broken lines are drawn 
every 20kb to assist in the interpretation of distances between SNPs. Two other lines are drawn to 
expose linear trends in the association of SNPs to the disease. The light gray line (or generally bottom- 
most curve) is a nonlinear smoother through the data points on the graph using a local polynomial 
regression method (W.S. Cleveland, E. Grosse and W.M. Shyu (1992) Local regression models. Chapter 
8 of Statistical Models in S eds J.M. Chambers and T.J. Hastie, Wadsworth & Brooks/Cole.). The black 
line provides a local test for excess statistical significance to identify regions of association. This was 
created by use of a lOkb sliding window with Ikb step sizes. Within each window, a chi-square 
goodness of fit test was applied to compare the proportion of SNPs that were significant at a test wise 
level of 0.01, to the proportion that would be expected by chance alone (0.05 for the methods used here). 
Resulting p-values that were less than 10'* were truncated at that value. 

[0258] Finally, the exons and introns of the genes in the covered region are plotted below each 
graph at the appropriate chromosomal positions. The gene boundary is indicated by the broken 
horizontal line. The exon positions are shown as thick, unbroken bars. An arrow is place at the 3' end of 
each gene to show the direction of transcription. 

Example 8 
In Vitro Production of Target Polypeptides 

[0259] cDNA is cloned into a pIVEX 2.3-MCS vector (Roche Biochem) using a directional cloning 
method. A cDNA insert is prepared using PGR with forward and reverse primers having 5' restriction 
site tags (in frame) and 5-6 additional nucleotides in addition to 3' gene-specific portions, the latter of 
which is typically about twenty to about twenty-five base pairs in length. A Sal I restriction site is 
introduced by the forward primer and a Sma I restriction site is introduced by the reverse primer. The 
ends of PGR products are cut with the corresponding restriction enzymes {i.e., Sal I and Sma I) and the 
products are gel-purified. The pIVEX 2.3-MCS vector is linearized using the same restriction enzymes, 
and the fragment with the correct sized firagment is isolated by gel-purification. Purified PGR product is 
ligated into the linearized pIVEX 2.3-MCS vector and E. coli cells transformed for plasmid 
amplification. The newly constructed expression vector is verified by restriction mapping and used for 
protein production. 

[0260] E. coli lysate is reconstituted with 0.25 ml of Reconstitution Buffer, the Reaction Mix is 
reconstituted with 0.8 ml of Reconstitution Buffer; the Feeding Mix is reconstituted with 10.5 ml of 
Reconstitution Buffer; and the Energy Mix is reconstituted with 0.6 ml of Reconstitution Buffer. 0.5 ml 
of the Energy Mix was added to the Feeding Mix to obtain the Feeding Solution. 0.75 ml of Reaction 
Mix, 50 (il of Energy Mix, and 10 (xg of the template DNA is added to the E. coli lysate. 
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[0261] Using the reaction device (Roche Biochem), 1 ml of the Reaction Solution is loaded into the 
reaction compartment. The reaction device is tiimed upside-down and 10 ml of the Feeding Solution is 
loaded into the feeding compartment. All lids are closed and the reaction device is loaded into the 
RTS500 instrument. The instrument is run at 30°C for 24 hours with a stir bar speed of 150 rpm. The 
pIVEX 2.3 MCS vector includes a nucleotide sequence that encodes six consecutive histidine amino 
acids on the C-terminal end of the target polypeptide for the purpose of protein purification. Target 
polypeptide is purified by contacting the contents of reaction device with resin modified with Ni^* ions. 
Target polypeptide is eluted from the resin with a solution containing free Ni^"^ ions. 

Example 9 
Cellular Produgtipn of Target Polypeptide? 

[0262] Nucleic acids are cloned into DNA plasmids having phage recombination cites and target 
polypeptides are expressed therefrom in a variety of host cells. Alpha phage genomic DNA contains 
short sequences known as attP sites, and E. colt genomic DNA contains unique, short sequences known 
as attB sites. These regions share homology, allowing for integration of phage DNA into E. coli via 
directional, site-specific recombination using the phage protein Int and the E. coli protein IHF. 
Integration produces two new att sites, L and R, which flank the inserted prophage DNA. Phage excision 
from E. coli genomic DNA can elso be accomplished using these two proteins with the addition of a 
second phage protein, Xis. DNA vectors have been produced where the integration/excision process is 
modified to allow for the directional integration or excision of a target DNA fragment into a backbone 
vector in a rapid in vitro reaction (Gateway™ Technology (hivitrogen. Inc.)). 

[0263] A first step is to transfer the nucleic acid insert into a shuttle vector that contains attL sites 
surrounding the negative selection gene, ccdB (e.g. pENTER vector, Invitrogen, Inc.). This transfer 
process is accomplished by digesting the nucleic acid from a DNA vector used for sequencing, and to 
ligate it into the multicloning site of the shuttle vector, which will place it between the two attL sites 
while removing the negative selection gene ccdB. A second method is to amplify the nucleic acid by the 
polymerase chain reaction (PCR) with primers containing attB sites. The amplified fragment then is 
integrated into the shuttle vector using Int and IHF. A third method is to utilize a topoisomerase- 
mediated process, in which the nucleic acid is amplified via PCR using gene-specific primers with the 5' 
upstream primer containing an additional CACC sequence (e.g., TOPO® expression kit (Invifrogen, 
Inc.)). In conjunction with Topoisomerase I, the PCR amplified fragment can be cloned into the shuttle 
vector via the attL sites in the correct orientation. 

[0264] Once the nucleic acid is transferred into the shuttle vector, it can be cloned into an expression 
vector having attR sites. Several vectors containing attR sites for expression of target polypeptide as a 
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native polypeptide, N-fusion polypeptide, and C-fusion polypeptides are commercially available (e.g., 
pDEST (Invitrogen, Inc.)), and any vector can be converted into an expression vector for receiving a 
nucleic acid from the shuttle vector by introducing an insert having an attR site flanked by an antibiotic 
resistant gene for selection using the standard methods described above. Transfer of the nucleic acid 
from the shuttle vector is accomplished by directional recombination using Int, IHF, and Xis (LR 
clonase). Then the desired sequence can be transferred to an expression vector by carrying out a 
one hour incubation at room temperature with hit, HF, and Xis, a ten minute incubation at 37°C with 
proteinase K, transforming bacteria and allowing expression for one hour, and then plating on selective 
media. Generally, 90% cloning efficiency is achieved by this method. Examples of expression vectors 
are pDEST 14 bacterial expression vector with att7 promoter, pDEST 15 bacterial expression vector with 
a T7 promoter and a N-terminal GST tag, pDEST 1 7 bacterial vector with a T7 promoter and a N- 
tenninal polyhistidine affinity tag, and pDEST 12.2 mammalian expression vector with a CMV promoter 
and neo resistance gene. These expression vectors or others like them are transformed or transfected into 
cells for expression of the target polypeptide or polypeptide variants. These expression vectors are often 
transfected, for example, into murine-transformed a adipocyte cell line 3T3-L1, (ATCC), human 
embryonic kidney cell line 293, and rat cardiomyocyte cell line H9C2. 

[0265] Modifications may be made to the foregoing without departing from the basic aspects of the 
invention. Although the invention has been described in substantial detail with reference to one or more 
specific embodiments, those of skill in the art will recognize that changes may be made to the 
embodiments specifically disclosed in this application, yet these modifications and improvements are 
within the scope and spirit of the invention, as set forth in the claims which follow. All publications or 
patent documents cited in this specification are incorporated herein by reference as if each such 
publication or document was specifically and individually indicated to be incorporated herein by 
reference. 

[0266] Citation of the above publications or documents is not intended as an admission that any of 
the foregoing is pertinent prior art, nor does it constitute any admission as to the contents or date of these 
publications or documents. U.S. patents and other publications referenced herein are hereby incorporated 
by reference. 

Nucleotide and Amino Acid Sequence Embodiments 

[0267] Table A includes information pertaining to the incident polymorphic variant associated with 
osteoarthritis identified herein. Public information pertaining to the polymorphism and the genomic 
sequence that includes the polymorphism are indicated. The genomic sequences identified in Table A 
may be accessed at the http address www.ncbi.nih.gov/entrez/query.fcgi, for exjimple, by using the 
pubHcly available SNP reference number (e.g., rs756519). The chromosome position refers to the 
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position of the SNP within NCBI's Genome Build 34, which may be accessed at the following http 
address: www.ncbi.nlm.nih.gov/mapview/map_search.cgi?chi=hum_chr.inf&query=. The "Contig 
Position" provided in Table A corresponds to a nucleotide position set forth in the contig sequence (see 
"Contig Accession No."), and designates the polymorphic site corresponding to the SNP reference 
number. The sequence containing the polymorphisms also may be referenced by the "Nucleotide 
Accession No." set forth in Table A. The "Sequence Identification" corresponds to cDNA sequence that 
encodes associated target polypeptides {e.g., ELP3). The position of the SNP within the cDNA sequence 
is provided in the "Sequence Position" column of Table A. If the SNP falls within an exon, the 
corresponding amino acid position (and amino acid change, if applicable) is provided as well. Also, the 
allelic variation at the polymorphic site and the allelic variant identified as associated with osteoarthritis 
is specified in Table A. All nucleotide and polypeptide sequences referenced and accessed by the 
parameters set forth in Table A are incorporated herein by reference. 



Table A 



RSJD 


Chrom- 
osome 


Chrom 
Position 


Contig Accession 
No. [1] 


Contig 
Position 


Nucleotide 
Accession 
No. [2] 


Sequence 
Position 


Amino 
Acid 
Posi- 




Locus 
ID 


API 


Allelic 
Variability 


OA 
Assoc. 
Allele 


rs756519 


6 


170707371 


Hs6_7740_34:11 


520890 


NM_002793 


inlron 




PSMB1 


5689 


F 


[cn] 


T 


rsl 042327 


6 


does not map 


NM_003194 


exonic 




TBP 


6908 


R 


[T/C] 


C 


rs8770 


6 


170743040 


Hs6_7740_34:11 


556559 


NM_002598 


mma-utr 




PDCD2 


5134 


R 


[C/T] 


C 


rs1 563055 


8 


27976377 


Hs8_23822_34:16 


6328752 


NM_018091 


intron 




ELP3 


55140 


F 


[C/T] 


T 


rs912428 


13 


44965904 


Hs13_24680_34:1 


28147904 


NM_015116 


intron 




CHDC1 


23143 


R 


[T/Cl 


T 


rs1 888475 


21 


38832863 


Hs21_11669_34:9 


25572863 


NM_004449 


intron 




ERG 


2078 


F 


[A/G] 


A 



rn Contig Accession Nu mber which can be found in the NCBI Database: 
ttp address: www.ncbi.nih.gov/entrez/query.fcgi 

121 Sequence Identification or Nucleotide Accession Number which can be found in the NCBI 

Database: 

http address: www.ncbi.nih.gov/entrez/query.fcgi 

f31 "A" column is the sequence orientation ("F" is forward. "R" is reverse'). 

[0268] Following are genomic nucleotide sequences for a chrom 6 region (SEQ ID NO: 1), an ELP3 
region (SEQ ID NO: 2), a CHDCl region (SEQ ID NO: 3) and an ERG region (SEQ ID NO: 4). The 
following nucleotide representations are used throughout: "A" or "a" is adenosine, adenine, or adenylic 
acid; "C" or "c" is cytidine, c5i;osine, or cytidylic acid; "G" or "g" is guanosine, guanine, or guanylic acid; 
"T" or "t" is thymidine, thymine, or thymidylic acid; and "I" or "i" is inosine, hypoxanthine, or inosinic 
acid. Exons are indicated in italicized lower case type, introns are depicted in normal text lower case 
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type, and polymorphic sites are depicted in bold upper case type. SNPs are designated by the following 
convention: "R" represents A or G, "M" represents A or C; "W" represents A or T; "Y" represents C or 
T; "S" represents C or G; "K" represents G or T; "V" represents A, C or G; "H" represents A, C, or T; 
"D" represents A, G, or T; "B" represents C, G, or T; and "N" represents A, G, C, or T. 



chrom 6 genomic sequence fSEO ID NO: n 

>6: 170689051-17077 9900 









aaatt^a at 








61 


ttqqaatata 








ttccttttct 


^^acaattct 


121 


taqaqataaa 


atctactqca 




cacttaactt 


ct ccca aa 
'^t^acaccRa 


act'^^t t'^ 


181 


cqtqqttaac 


gugcggLggc 


cacatcaact 






^tca^catcc 
g cagca cc 


241 


taqqtqcaqc 




tcactctcat 


cacattccac 


cattccca ^ 
ca cccagg 


ccacagaggc 


301 


aaqagqatac 


tctcaaaact 




ttta tttta 


ggaaaa agg 


acggaaaaa 


361 


atcttatqcc 


tcaqaataat 


aqqtqqaatt 


caatt ctct 






421 


tCTtccaaata 


qqcttacaaa 


qqctttccct 


a att t 


tt t t 


t 

g ggcaggc 


481 


attaaqqqaa 


qttcaqqata 


qccaqqctat 


tttqaqactq 


atatctt 

gga 




541 


tctctcctac 


aaaccactcc 


cttggcatca 


ctctcaaaaa 




c t t 
g c g gcag 


601 


cccacaccct 


g ggg 


ggcatttttc 






gcca caggg 


661 


atctaaaccc 


taactcta^a 


ggaacaggcc 


aggcagc gg 


agcg acaag 


ag caccac 




acaaataact 




g c aaaa 


gggaacc gg 




cc ccccaga 


781 


aaacacctca 


CTtcacattt 


cttt^caa^a 






oca eg ac 


841 


acacaaaaac 


ataaaaaaca 


cccaaaata^ 


cttca ct ^ 
c cage gg 


^caggcagag 

ggy 


g ggc caca 


901 


actatcacta 


aaacacact 






gcag aggaa 




961 


aaqataacta 


aqactaata"^ 
taaccatca^ 


caataca ta 


ttactattat 




aa^t^t'^t^^ 


1021 


atcaaqqtat 




gagcagagaa 


aa a a t t 


taa tata ^ 


^^ctta^ct^ 


1081 


taaaaaqaat 


cagaagaaaa 




accctca tt 


aaaa qcta 


ata tcta a 


1141 


aaatttgtta 


ggtatttata 


ttattaacac 


aqaacao t 


aacac ata^ 


t 


1201 


tgttacaaac 




aacaacaaaa 


taqacaacaa 


ccctagaaqa 


aaaatqcctt 


1261 


gaaggaaaag 


ctqgaatact 


qtgaqaqact 


taccagtggc 


ctqtctgcqt 


gtaactaact 


1321 


cctcatcccg 


acttqqtcac 


qcaaaqqaca 


ggtgaccata 


cctccaqqaa 


gg cggaaagg 


1381 


ggccctacat 


gagtgagagg 


ctcaagtcct 




aatctataat 




1441 


aaagaaagtt 


tgtggctatt 


tttagaaact 


aaagtttatc 


tcatattgac 


acaaactcaa 


1501 










atttctcctc 


tagtacatta 


1561 


tcatttaacc 


aatacaaaca 


ggtctgggcc 


aggtgctgtg 


gttcatgcct 


ataaacccag 


1621 


cactttgaga 


ggctgagggg 


ggtggatccc 


tagagcccag 


gagtttcagg 


ccagcctggg 


1681 


caacatggtg 


aaactctgtc 


tctaccaaaa 


gtacaaaaat 


tagccaggta 


tagtggcaca 


1741 


cacctgtagt 


tccagctact 


aaggaggcta 


aggtgagagg 


accactgagc 


ccagggacgt 


1801 


agagactacg 


gtgagacatg 


attataccac 


tgcactccag 


cctgggcaac 


agagggagac 


1861 


cttctctcaa 


aaagaaaaga 


aaagaaaaag 


aaacaggtct 


aatttgttca 


tctaagcaat 


1921 


gataagattt 


atatgaacat 


aagttgcttt 


attgatgaaa 


aattgaacat 


aatctaatca 


1981 


agcctctaaa 


tttaactgcc 




aaaagacagg 


acagaaccta 


caggaatgca 


2041 


atcatttata 


tctagaatgt 


ggaagattct 


gcatgacaaa 


cgacttggat 




2101 


gtaaatttca 


aggaaagaga 


gagagagaga 


gataaaaagg 




gatttgaatg 


2161 


acatgttaat 




gagacaatca 


aggaaatttg 






2221 


tgttgaggga 




tttttaaagg 


tacaattgtg 


atactgtgat 


tatattttta 


2281 


aatgcgatca 


ttatctttta 


ggagcactaa 


aatatttact 


aataaaatta 


taggatttac 


2341 


ttcaaaataa 


acaacgataa 


taagattacc 


atgaatagtg 


gtgggtgaaa 


tatacaaggg 


2401 


gcttatttga 


catttccata 


ataaaaaaca 


cgaatgaata 


acaaagcatt 


atagaaattc 


2461 


aagtaaaaaa 


tagctcgtgc 


ttagtaataa 


atcaaagcgt 


gctggacact 


ttaaataata 


2521 


aacaaagata 


agatcttgcc 


ctctagaaaa 


cagctctaat 


tcaggacaga 


cttgctcaca 


2581 


tcagatggaa 


gagtcagaat 


gagatgtgct 


gagtagaagg 


tagaacggtc 


gctagcagag 


2641 


ggtgggaagg 


tggtatgtgt 


gggggagagg 


agaaagagag 


gttgattaat 


gggtacaaac 


2701 


agacagttag 


atggaaggag 


tgagatctgt 


tgttcgacag 


tagagtaggg 


tgactacagt 


2761 


taacaacaat 


atttgcatat 


ttcaaaatag 


ctagaagaca 


ggacttggaa 


tgttcccaac 


2821 


acatagaaag 


gtgatgaact 


caaggtgctg 


aacaccccaa 


ataccctgac 


ttgatcatta 


2881 


tgcaataaca 


aaatatcaca 


tgtaccctgt 


aaatatgtac 


aaatactaca 


tatcaattta 


2941 


aaaattttca 


cacaagaatg 


agatgtgctg 


atgcatgtgg 


aggccctagc 


tccagactgg 


3001 


gtgtaaactt 


caagccactg 


aagatcttat 


ttccaaggtt 


tttctacttt 


gaaattccaa 
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3061 


acttattttt 


ctagcagatt 


tataagggac 


acgggacaac 


ataaacttgt 


taaagtgaca 


3121 


agagaaagta 


aatatgccct 


taaatttata 


ccaaatcttc 


tgacaagtct 


tgactgataa 


3181 




caaatttgtg 




gagaaaacgt 




catttttaag 


3241 


tgtggacact 


tggcattgct 


cacggcttcc 


aacggaataa 


aatagggctt 


agttgttttc 


3301 


cattagcttt 


ttcctttgtg 


tgtgtgtgtg 


tgtgtgtgtg 


tgtgtgtgtg 


tgttgtgtgt 


3361 


acatattggc 


tctaagcatt 


tattagccca 


ataattttta 






3421 


atatcttttg 


ccgtattaga 


tttttgtaac 


atgtgctaaa 


ggttaaaaca 


ccttttcccc 


3481 


ttcatgaagc 


ctgagagaag 


tcggttttct 


gggttttCtC 






3541 


tatacgtctg 


tctaaaaagt 


ctcagat't'tt 


tcttgctaat 


tgtgcacctt 


cataatcaag 


3601 


cagacaaatc 


agaatattat. 


tttggtgagg 


ccatcatcta 


aactaacagt 


ctttatgtac 


3661 


agaagcagca 


ctgaccgggt 


tcatactcct 


cagttagcaa 


gtcaacatct 


tcctcctgcc 


3721 


agcaacccat 


cccagaatat 


ctgtggctta 


ttaatactta 






3781 


atttactaat 


taggagatga 


tcagatgtat 


ctattgatag 


caacaggcta 


tttaaaagtg 


3841 




tcaaacagat 




ctcaaagttt 


ccagttagat 


atttttcatt 


3901 


aaattgattg 


ctagattgca 


gccacaatca 


aacttaagta 




tttggttggt 


3961 






ttcaaggggt 




atagaattcc 


aaatgtataa 


4021 


agtcttgtcc 


taaaatggtc 


aataaaatga 


accagtccac 






4081 


agtccactac 






tctgcatttt 


catgttttca 


cacacctgca 


4141 


gtcattgtgt 


acaattctgg 


actcttgatt 




attctgcgtt 




4201 


catgttgctg 


ccaggtcttc 


atggccacca 


cttttaatqq 


ccagaaaata 


tttcatgtag 


4261 


tgaacatttc 


ataaattact 




ccaattattg 






4321 


ctaaaacacc 


actcttgtga 


caaagttctg 


atgtatccag 


atgtactcat 


accaaatcat 


4381 








gcccttttct 




aa aaac t 


4441 


tcacaacaag 


aagcttaagg 


cctctacaaa 




aattaaaatt 


tcaatttcac 


4501 


tctgaggcac 


atacaggaac 


cactaggtta 


tttggcttag 


aatggaggga 


agtgtcattt 


4561 


tgtttctgtt 


cggcctgcag 


gaagctcctt 


cccaggccct 


gcattgccaa 


ttgaacaatt 


4621 


gaacaattct 


cattgttcaa 


ttcccaccta 


tgagtgagaa 


catgcattgt 


ttggtttcct 


4681 


gtccttgcga 


tagtttactg 


agaatgatgg 


tttccagctt 


catccatgtc 


cctacgaagg 


4741 






atggctgcat 


agtattccat 


ggtgtatatg 


tgccacattt 


4801 


tcttaatcca 


gtctatcact 


gatggacatt 


tgggttggtt 


ccaagtcttt 


gctattgtga 


4861 


atagtaccgc 


aataaacata 


cgtgtgcatg 


tgtctttata 






4921 




tactgttgaa 


aaaatgaata 


aattgagcac 


actaaqtqtc 


tqaataaaat 


4981 


accctgacca 




taaacaacat 




atttcaaLLt 


ctcqqaaaaq 


5041 




gtgtccaatg 


ctcttgttat 


gcagtaatgg 


catctttqat 


tattcatatt 


5101 


cgttagagct 


tccagaaagg 


agtattgcaa 




cctctqactc 


tcatqaccaa 


5161 


atccccctgt 


cactgtcctt 


gttctctgat 


ccttcctctg 


qqccctcqqa 


aatgctggtc 


5221 


tccatcaaat 




agttttcaga 


aaaagttctc 


tttgggaagt 


ttcaagagaa 


5281 


aaccacaaat 


ttcctggaaa 


tgcttcatgc 


ttcatgacat 


ttaaggcttt 




5341 


aacttcagga 


gcaagatagc 


tggtaggtct 


tctggaggtc 


ttgcctaacc 


tgagaaggtc 


5401 


ccagagaati't 


tgtgcataga 


acctcccagg 








5461 


aggatgagaa 


aggtaagagc 


ttiaccaatgt 


ctccagtttt 


gtaaatgaat 


attcttaaca 


5521 


tttttggtga 


ggagaaaaaa 


atatcaaacc 


catcacagac 




tctcttoacc 


5581 










caaataaaaa 


atqtcaatat 


5641 


aaggctagac 


acaggggctic 


atgcctgtaa 


ttccaacact 


ttggaaggcc 


aaggcagacc 


5701 


tagaactttg 


taaatctgag 


attttgctcc 


atgactttgt 


ggatactttc 


tactaatacc 


5761 


acctacacaa 




caagaatctt 


aattgcattt 


atcaacttgt 




5821 


cataacattt 


agattcattt 


gaaagaaatt 




aaagaaataa 


gaagaacttg 


5881 




acaaagaaaa 


cccagaaaga 


gagagagaga 


gagaagcaga 


aaaactaaac 


5941 


actgaaaccc 


agagagagag 


agagaagtag 




cactgagtct 


taatctgttc 


6001 


tgccactagt 


cagcagcctg 


cacagagcac 


tttaatgtat 


tctctgtcca 




6061 


tgtttatgaa 


agtatatagc 








ctctccaaac 


6121 


cacaaccagt 




cttctctcct 


cctcctcctc 


^^^^^^^^^^ 


tttctctctc 


6181 




tctctctctc 


tctctctctc 


tccccctttc 


cctctccccc 


acccacccac 


6241 


ctcccaaccc 




ctaggacacc 


tccagcatag 


gttactacca 


attttgcaca 


6301 


cctccagcaY 


aggttactac 


caattttgct 


gggtttatct 


ccttctgtcc 


tttccgcttt 


6361 


gatctgagga 


atagctgaga 


tttaggacag 


caacaaggtg 


tacctccttc 


caggttataa 


6421 


aacaggatta 


atgattaggc 


tcaaggcccc 


ttcctagtca 


ctcagtaaag 


tctgtgcact 


6481 


ggaaaactgt 


ggtagcagtt 


ttctgagcat 


tagaaaactg 


tggttctcac 


agaggctgga 


6541 


tgagtcaaca 


ctgccatctg 


gcggcctctt 


ggagggtgat 


gtggagcctg 


gctttcattg 


6601 


aagaatgaag 


gtcccttgat 


ttcctgaccc 


tcagccaact 


ctccagcagc 


ttctcactgc 


6661 


agaagaggtc 


aggccattgg 


tcagcttgag 


gacaaagtgg 


gaggatcaca 


ctttcgatca 


6721 


cctatacttt 


ctaacaatca 


gccctgtgga 


catctgctca 


cgccccatgc 


tgtttttaaa 


6781 


atattttccc 


attatagaaa 


tatttttcaa 


agatgattca 


caagctccag 


gagccattca 


6841 


gacaagggag 


agcaaattgg 


cagctaaact 


cattcaagag 


tggagcagat 


gcacatgaag 
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6901 ctctgtctgg cggaggaggc acaagacacc gagcctggct gggagggtgc tcatgacaag 

6961 agtggggcca caggcctctc ctttcattgg acatagtggc catacaaagt ggctgccttg 

7021 aaatgcaccc acatacacta tcggtcctcc cggtctaggg agaaaactS^ cacagtatgt 

7081 caacagataa atcagcgcaa tcctgtgggt gaagcagtgc acccatactc ttcattctgc 

7141 tgagcggaac tcaagggtga gacacttgta ttcctaactc cacttgcctt caggctcctc 

7201 caacattcca aattgcccat ggacagagct actacccctc catagaagtg acaacttgaa 

7261 ataaaagatt tgaagctcct tcccacgttt agacccaggc ctgctgctag gaactccaga 

7321 ggagggatgg aaagaagttt gcactgctca cagatttaat gtttctctca caaccagaag 

7381 taggcagcag gatggatttt caatcaacac taacaaacgg atcactcctg ggtcctttaa 

7441 acaagttcca tgtctcttta taggttttag gtgcctccta tgtgttcaga cttcgtataa 

7501 tggatatata agaaaagtaa atgggagggg caatatttaa gaaataacag caaacaattg 

7561 cattcatatt taatataaac tacaagtatc aagttaatag acgctcagaa gaaattcata 

7621 aaatctagaa aagttgggaa aatctcaata gagaaaataa aacttgatct ggactttaag 

7681 agtatgattt ataaagttca aggggacagt acacaaagtc agacaatgtt agctgaatca 

7741 ctgaagaatg agaatatcac acccacttaa caaagcagcc cagctagtta aagaagtatg 

7801 ttgaagaggt agggagttgg gtgggggagg cgaggatggc gtggagggat ctgaaaacta 

7861 tcaaccttaa ataatgagat ttagaaaata tgattaagta gagggttaat tcgagtgcag 

7921 agcttgagaa tggccacctg gaaacactga ctctaaacca gtaaggttaa tgtttcaaag 

7981 tggagaagtt aaggtttcac ttagaaattt tagcaggatc acattttcca tacaagacca 

8041 gtgcattcgc cacagcaatt tgattggtta tagattgctg ctcattccaa gattacttta 

8101 ttactctgtg cggaggagta gtgatttgag gggtcttatg tctggtgcct ttttgtcttg 

8161 tttacagggg aaaaggcaga agttgcgcct gcatgccgca taactcaggc tctgcatagc 

8221 cacatgtctt tcaaggctca gaataatttg aagttccaac agctttaagt ttgaattaat 

8281 ttcacaaagc ggagaaagac ttggacttcg tgcagtaatg aagaccaccg atgggtactg 

8341 aacagctctg gtgggtttgc actcagagaa aggagcctaa gatgcagaag gtgcagtgag 

8401 gcagcagcca tctgctcttg tgctgctgag cagagcatga tgggctgtca ccgctcacgc 

84 61 gtgttcttta cctgctgact cacctggcgg catgggctgc atatggggtc ccagctcctt 

8521 aagctcctgt caggcccttc tgcaacttct cccaagctct tgggccaggt gcatgtctag 

8581 ccatgaaaaa ggaggccagt acctgatcac taagtgaaag ttctaaggta gtgggactgc 

8641 cacaggtgtc ccccatggtc ccaggtcaca atccagtctg ttgaccctcc tccttttgca 

8701 ccatcgcccc tttgacagcc tgtgctatgg gttttaggct cctagcacca aacagaaaca 

8761 ggcttatata gatcatctaa tcagctcttc ttatgtgagg ccgaactctg taataaatct 

8821 ttttatgtct cctagggctt ctctgattga acctgtctga tgaggagtta aagaagttct 

8881 caaattgtgt atgcataaga atcacctggg atttctagct cagagactgg gacaataggt 

8941 ctattgaacc taggagttca ttttgaacaa gtgctctacg taattctgat acagggaatc 

9001 ttccagttac agtttgaaat tcacaaatac aaggaatgag agacctagaa tcagaaagca 

9061 tgttaataca cttttgggcc attaagtgct cacccaggta acaggtacag aaaggcagaa 

9121 aaaggaaacc tatcagggta ataattatgc cttttttttt tttttttttt gagacagggt 

9181 cttgctctat cacccaggct ggagtgcagt ggcacaacca cagctcactg cagccttgac 

9241 ctcctgggct caagtgattc ttccagttca gcctcttgag taactacgac tacaggcatg 

9301 tgccaccatc cctggctcat tttttgtaga gagggggttt ggccatgctg cacaggctgg 

9361 tctcaaattc ctgggctcat gtgatttccc cacctcagcc tcctaaagta ttaggattac 

9421 aggcgtaagc cactgtgctt ggccaagaca ttttggtaag aaatattatt ttcctactaa 

9481 attgtctaca ttccccttgg gtaggcttgc aaagtcactg tgactacagc aggagctatt 

9541 gctgcatggg aaatatggag acacgagtgg tacctggcag tcacgggctc agtttgtttc 

9601 taacctccca agtcagcaca gccccactga gcagactgcc ggaaagtatt tatgccatct 

9661 gtcggataat taagacaaat ccaaacatct acgtgcattc tgtgtgtata aatggagtca 

9721 tggccaacct ctcaagcagt tttccatcaa tcacttgtaa tattaccaga tacttccaat 

9781 ccccttgcag gcagtaatga agagaaagtc tctacatcag cagcttctca ctggaatctt 

9841 caagcctaac tcttaagaat gaatcatact tctaatgttt atcaaaggtt ttcttttcac 

9901 acaaggaatg tgcctgaatg atctataaaa ctacccaaat aactagtagc caggttagct 

9961 ctaggagcca tatgccaaac acattctgga ctaagtcgtg ttcactccaa aactaagacc 

10021 atgttcaaag tctacagcaa tactggatta aaacaatgtt agctgaaggt tgtaaaaaaa 

10081 catcatggaa cactaattat gtgccatcaa tcattactca atttgtagaa cttccctata 

10141 caccttggga ggtgagaacc ttatagaaaa tcctagatac acaactttat ataaacagaa 

10201 aattccaaaa tgggaaatcc tttagaataa gaatatacct agtgtatata aagaagaaaa 

102 61 aatgtagatt tgaaattggg atactctaca gtttttagtt gggacattgg gatagttaat 

10321 tttaggtgtc aacttgactg gattaaagga tatcaagaca gcaggtaaag cattacttct 

10381 gggtttgtct gtgaggatgt ttccggagac tggcctagga gtttgtggac tgagtgcaga 

10441 tcagcctgtg tgagcaggca ccatccaact gaatggagcc cagatagaac caaaaggcaa 

10501 aggcaaataa ttcactgtct ttctcctcga gccagggact tgattagcaa cccctcacac 

10561 ccaactcctt attttcagga cttcaagggt tataccatca gcttccctgg ttgtgaggcc 

10621 ttcagacttg gactgagcca tgctgccagt ttctctggct ccccagcttg cagacagcca 

10681 ggcatgggaa ttctcagtct caataactga gtgagccaat ttctttaata aaccctttct 
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10741 catctctctc tatatacaaa catatcctat tggttctgag tctttggaga atcctcatac 

10801 agacactgtt aagagatgaa ccaggtgtta ccactgtgtc cccaatgcct gggatagtga 

10861 gtgacagtat tgatgtttgt gcaataaatg tttacaatgt gtatctacta gcagctttcg 

10921 ataatttgaa ttatttataa aaggcaaaat aaacaaaagt ggtatctaag attaaccact 

10981 aacttagtaa aattccattt ataccacagg ttgttctggt acaaagcata caataactcc 

11041 catgttacat ataatttgac ttgttcaata acaagaaaat tactcaaata ttataatgga 

11101 gttaaaatgt atttaataga tctaaaaaat tttggctcta tcaaaactta ggtctcaatt 

11161 ataacattac taaagtgctt gtaaaaatta tcccagattt ctggctacag cttgtataaa 

11221 ggcctatttt ttttttttta cagttttatt ttggttgaag atttttctcc aaaatagcta 

11281 ttccagatgg cttaacagtc ccagaagtga aaaatcttaa gatatttcat ttataaccat 

11341 tagagtctta ataaaaccct agtaaatact ctcccttctg gatggtttaa aggtcccttc 

11401 aggcaagctg gctgcgtaaa caccagagcc ctcttcataa gataagtttg ctcaaaaggt 

11461 caacttttac agagaaattc ctacctcatt agtgtaaggg aaaattaaca tgacctgttt 

11521 tcattactta aaatgcaaaa aaaaaaaaaa aaaaaaaaaa aaaatctcag aaaaatagaa 

11581 aagggtggga aaaatgaaga aaattaaaag aaattctact tcctatatct gtcctgctta 

11641 gagaagacaa gttatagcaa aatgagtact tcaggtttct cttttaataa acaaaaccat 

11701 ccaaggtaca gttccaaagt acaaaatcaa ccaggtctga actgattggt gataagagca 

11761 cacagatcag tccttcctta aggaaacagt ttcctccctg atgccctctt tggtcactat 

11821 gcagatccgg agtgcgtccM cagtgtacac atctctctca gccgcagaaW tgaagacatc 

11881 tttcaccagc cgcatggctc tgtccaagga cagcggaaca tgctccacat tctgcatgtt 

11941 cttaaaacca acctggtggg acatgaaact tgggtgagat gtcatgtgac agtgagcaca 

12001 aaggaaaggt tcaagtttct tccaataccc tcatgtgtac gaggatttgt gatgaaaata 

12061 ctaYacaaaa ttaaaatgtc cttcagactg aataaagacc cttccagacc tggatgaatt 

12121 acacacatca gaaatctgac aacgattccc tggcaaagct cccactcttg agaagcacat 

12181 gacccggaga ggtggacgta tgcaagcaaa gtgatcagag aattgctata actaagaaga 

12241 aaataagagc caggagagcc ccaacagatg tccactaagt ctcctagaag aagacaggct 

12301 tcagcagaaa ggacttttaa gtaggtgagc gctgaaggct acactgaaga gcacagaaca 

12361 ttccagactg aagatgtggg acggagttag gaaaagcaca tcttttctga gaaaagcaga 

12421 gttagggtct gggtgtagtg gggtgacagg agatgaggtc aataagcaat aagcagagga 

12481 gtagctatat atgacatcct tcagaaaact gagcaggagg aattagggaa gtaaacttgt 

12541 aggcagaaag gatgttagga aataaatcca ataatctaat aatgtaggtc tgaatttagc 

12601 cagcagtaat cacaaagagg gacgaattag aaatacaggc ggcctttgaa taacaccgtt 

12661 ttcaaccact caggcccact gacacataaa ttttcagtaa acacactgag aagtttttgg 

12721 agactcgcaa cactttgaaa aactcagatg agccaagtag cctaaaaata ccttaaaaat 

12781 taagagaaag gtatgccaca aatgtataaa ctatatgtag atactagcta ttttatcatt 

12841 tactaccata aaatacacat aaactgatta taaaaagtta aagtgatctc tcaatgttct 

12901 cctttgtttt tcattgtgtt tagtgcgatt attataaacc ttaaataaca agtggcccaa 

12961 acaaagcact attagtgatg ctggaactgc tccccaagaa gcagggaaag ttatgacatc 

13021 acaagaaaaa gctgaattgc ttgatatgta ctgtagattg aggtctgcag ctgtggttgt 

13081 ccaccactcc agacagatga ttctccttgt aaacagatga tgtaaactta cagtatcaac 

13141 aaatacagta ttgtaaatgt attttctctt cctataattt tcttaataac attctctttc 

13201 ctgtagttta tttatcataa gaatacagca cataatacat atgtaaaata cgtgataatc 

13261 cacagtttac attattggca aggcttccag tcaacagtag gcttatttgt aagttttctt 

13321 ggagtcgaaa gttatatgtg gattttccag tgctcgggga actggcgccc caatccccgt 

13381 gttgttcaag gStcaactgt atgcttggga gttcaagtac agtattgggt aacacagaaa 

13441 gggggggtaa gagggaaaaa ggcattttta tgatcttcag gtaaagacag caagagaatt 

13501 agccactaca gacataaaag agttggaagc catgagaata caggaaaaca atcaaaacca 

13561 tgggagtagg taaggaccca caagagagta ggcagaacac agagagagac catgaggttt 

13621 ctgtaggaca cgtctgccaa actcaaacag taatttataa gaaagtcttt atatcttggc 

13681 tctcctaagc taatgaagag gagaacttgg aggaaggcac ttcaactgaa caccatgaca 

13741 agttggacat agtatcatta cctggacaca gtatcattac ctggttgtca agcaggggct 

13801 gtagcatggc acttgctgag cctccagcct tgaaggagtc tctctggtaa gaccctactg 

13861 gatcaaagct gtatacagcc ccctttcctt aaagaagaaa acagtattca taaggatggt 

13921 atccaatcac aatcccaaat catcgcaaaa ataaatcagg tcaaaacgtg acacaaaacg 

13981 gactatcacc ttggctaaaa cttcagggaa cagttggaac acagccactg ttgcactggt 

14041 gacagctgca gcacacacac tgttgcacat gtgacagcca gaacccagtc accattgcac 

14101 aggtgccacc cagcagacag ccccaatccc cgtgttggtg tgttgttcaa gggtaaactg 

14161 taggcttggg agttaaaata caatattgag taacacagaa agggtgggta agagggagaa 

14221 aggcattttt atgatcttca ggtaaagatg acaagagaat taaccactgc agatgtaaaa 

14281 cagttggaag ccatgagaat acacaaaaat aatccaagcc atggtagtag gtaaggaccc 

14341 gtaagagagt agggagaaca cagagagaga tggtgaggtt tctgtaggac acatcagcca 

14401 aactcaaaca gtcatttgta agaaagtctt tatttcttgg ctctcctaag ctaacgaaga 

14461 ggagaacttg aaggaaagca cgacttatgt atatagacag gtccaaggga cacatccgca 

14521 agacacagta aaacgaagag aattctgtaa cctgtaaaat acgatactgc ttttgtttaa 
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14581 aagaataaga aaactatttg tacaagcaca aaaaagtctg gaaagataag taactgtggc 

14641 tggtcatctt tgagctctga attagaggca agaaaagaag ggaattaatt catgtgcttc 

14701 tatcttattt gagtttgtta caaagaacac aggtgattct acaatcgtta aaaaggcctg 

14761 ttaaaaacaa ccgtgttctg ggtatgagaa atcacgagtc atttacaagg tatgattttt 

14821 taaaaacata actgatttag aaaacacatt atataccaga actcagtacc ctgggagatg 

14881 tgatataggg acaatctcct tactaaccat gtacaactgc cactgttagc tttgtatgtt 

14941 ttctactata gaactgcgta aacttggcca ggcgcggtgg ctcacgcctg taatcccagc 

15001 actctgggag gctgaggtgg gccgatcacg aggtcaggag ttcgagatca gcctggccaa 

15061 cacagtgaaa ccccatctct actaaaaata caaaaattag ctgagcatgg tggcacgtgc 

15121 ctctattccc aactacttgg gaggctgagg caggagaatc gcttgaaccc aggaggcgga 

15181 ggttgtggtg agccgagatc atgccactgc actccagcct gggcaacaga gcgagacttg 

15241 gtctcaaaaa aagaaaaagg attgtgtaaa ctgaattctc tggtggagaa ggcaccatct 

15301 gaggtcacct aagaaactgg gaagtatttt tgctggaagc aagcttggga caagacatga 

15361 gatcacagca ggcaatccac taacctcaat atacaaaaaa agaaagagat gtaaggtgta 

15421 aaggcagaaa aatgattgta tctctcatga ggatgacagg ccacaagtaa actgaggccc 

15481 aaaagcttta cccaaggatg gaaccaatct cagaaaacca cacacaacaa atttaaaaaa 

15541 ataaaccaaa gaaagggtgg gctatgtcca agaacacagc tttaaagacc ccctagaaga 

15601 agcaagagag acactagtaa aatcagtata aaacagcaaa aatcaataaa taaaaacaaa 

15661 aataaatgta gaatggcatg aatcttgatt acaaagtgct tttcctcatc aaagtaaaat 

15721 ccaaaaagca caatcatttc aatgtaaaaa ataaaatcag acaagtctta aaagaaatac 

15781 agctaagctt tggatgggag aagtttttct aacaaggcaa gaaactgaat gtcataaaga 

15841 aatgattgac agattagact gtatgaaaca cagcaatttc tgtgtagcaa aaggcaccac 

15901 ccaaaagaga caaataacac tcttgggagt atctataata tcacagacat ttgacacagt 

15961 cgaatgtagt tcttgaaaat cagtaagaag aaaactaacc tgcaaataag gaaatgggca 

16021 acagttaata tatgaatcag tctccaaaga ggaaatttat aaagtcaata aacatatgaa 

16081 aacatgccca agaatatata taaatgaaaa taataaactg ttcaataggc aaaaagatgt 

16141 ttcaaaatgt gttaattcta atactgggaa ggttatataa aatcgggtac tcagggtact 

16201 ggcgtatcct tcagaaagag tatactcaac aaaatacaaa aaaaaattta tctgacccaa 

16261 tacttcaact actcgaagtt tattctagca agacaagcat aattggggaa gtacacaaaa 

16321 atgtatgtac aaggatgttc atcatagtgt ttaYattaag aaaacaggaa gccaaaatgc 

16381 cctttagagg actggttagg taaatgatgc tatgtattac agcagcaaaa acagaagaca 

16441 aaaaaatttt ttgttaatta aatacaaaaa tcattatgca tatagcttag atctccttga 

16501 taatacataa tgtatagtta cagaaaaaca gataccaaag tcctacttac ataaaactRc 

16561 otgagtgtat gtataaacat qtactgattg ttttcactac agaaaaacta aagccacttt 

16621 taatttggga gaaaaatttt taaaaggcat gagagcagct aagagattaa agacagagag 

16681 gttagtgcag acagcggttg acaccaaggg aaagagctgt ggaccttgct gtccLgaagg 

16741 tttcaatatg gagcactgct ctccaggtcc ctgcatccac caacacagaa gacagtaaag 

16801 gaagtctcct gagacattag cagaggctct ggacttaggg gaccaggctc tagagtggca 

16861 aagggtagga gcaaaacaca tgaaatggaa agttgatgta aacaaaacac agaccttcag 

16921 cccccttcct cctcYgaggc ctaaaaagct gccaaccggg cttctcttcc tgtagtccca 

16981 cctcacctca ggaaaaaaga ttaaagaaat tcttcatctg aagaaaatgc tgcttactta 

17041 agcacccagt taacggctgt cagttgacaa gccatgtcaa caaggacttt cataatcaaa 

17101 ttttggtaca agactcttca atgtgaatgg acagcaaaaa aaaaaaaaaa aaaaaaaaaa 

17161 tcagagattt gaaggaagcc tctaacatag aagacagagg ccaaaaatga acacataaca 

17221 tgaaaaaaaa aacattctga tgaaatacag aaaactgcaa agaaagtaaa caggattttt 

17281 tttaaaaaga caaaaaataa gtacaagctc ttggaaatta atgacaccag aaataaaagc 

17341 aatcagtaaa ggatgaaaag ctaaaattaa ggaaatcttc ataaagtaga acaaaaaaga 

17401 cagacatgga taacatgggg agtcaaaaat aaagtaccgg gtcataatcc aggtggttgt 

17461 ggagaaggtg cagaggggta ggggaggagg agaggaaaga aggtggagaa agaggataaa 

17521 acaaataaga aactctaggc aaggaacatc actatacaat ttcagaacac cagggtaaag 

17581 aaaagatcct aaaagcaaag tctcaaaaag gtcccRtttt gtttttttca gaaatctttt 

17641 aggaggtaag cctctaaaat aaggaaataa atcaagaaag aggagaacac aggatgtggg 

17701 ggaatctcag gatggcagtg gtccatcctg gagcaaRggg aaaagggatc cacctagtat 

177 61 gtgaggtctg acaaccagat gaattgtact cagagaatgt acaaggaact gacaaaaagt 

17 821 ttaaagaaat ctgagccaat tagagatagt taacttaaga aaaaaataaa aacgagttac 

17881 acaagaaaga aaatattaat gcatgctact tggcgcaaca aagatcaata tctatctggt 

17 941 cataataaaa acattaaagg tctatgttac caaaagtgta atgtgttaaa ggaggatgct 

18001 aatgaggaag aggacacaaa agagctaaga accatcatct accagaacaa aacaccaaaa 

18061 gatatctcaa aagagtgcag tacaaataca gagttttgga aaatggaggt aagtaccagg 

18121 aaatacaggt aaaagagctg aaaactgtct tagggcaatg gaaaaaaaag atagaggact 

18181 gctttattat tcacacaaat tcctctataa aataaaaatt agtttaaaga acacttaaaa 

18241 gccaaccaga cagacaagct tagaaagaca gttctagaga cacctgaggt tgtcaccaaa 

18301 actagcaaca ctgttacatg Raaatcaaag aacagggctc tgaagtgaaa caaaagccaa 

18361 acctgaatct taggtctgct gcttagtagg tatcagcttt cccagattag actctctaac 
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18421 


tctcggaaaa 


actcaaagga 


tgctgtctgg 


ccYgctgaaa 


ttcaatggca 


gctcagctat 


18481 


ttttaatgtt 


gctaaaatct 


gaatctatta 




aaagccaggc 


ataatgtaag 


18541 


tcaaaaaaga 


caccagaagc 


atccagcttg 


ctaagttagg 


caaattaatt 


tgtagctaga 


18601 


ctctaattac 


ccatgttaaa 


ggtaattcat 


ttctattttt 


actatttcta 


cactttactt 


18661 


tttctcttgg 


ttttcaagta 


aggttgaaag 


ccttaaaaag 


aaattgtgtc 


aacctaattt 


18721 


gtctacgcct 




aggtaaaatg 




tccccaaggc 


tgtgtatggg 


18781 


ggcggatgtt 


aagagtcaga 


gatagcgcac 


Rcaaagaaaa 


cgctcatgga 


acagaaagcg 


18841 






ggcagggcgc 


tgaaatgaag 


cttgaccctt 


tcctcttaga 


18901 


attttccctc 


acaggattta 


gggtccccag 


tccccttgct 


ttctttctgt 


gtccaactga 


18961 


gatgttctgt 


gatgtggcca 


gccgcatgta 


ttcccctgga 


ggctctgacc 


caagccaagg 


19021 


cctgccttga 


acattcccag 


gcgctgacaa 


tgttgtttag 


gctattgctt 


gaaacactga 


19081 


gaaattaccc 


ctgttgctaa 


acacacagaa 


actatccctg 


gccctgaacc 


aaattccttc 


19141 


aacactcaaa 


tcaactccct 


caccttggac 


acatcttttc 


ctgctgtctg 


ctgagaggac 


19201 


tgctgcagca 


ctaaatgctt 


ttgactgatt 


accctgagat 


ttagtgttct 




19261 


ctaggttcgg 


ggcagtccct 


tacaggaatt 


cctctgcccc 


tacttttagg 


gcaacttcag 


19321 


ccacaggttc 


agccagaagg 


aacacagaag 


atgggcttac 






19381 


gacacttctg 




tctcaatgga 




tgacacagat 


ggactactta 


19441 


tccatctgtg 


tcaatggact 


actctaagtg 


acaaagatct 


gtcctgggac 


actgtacttg 


19501 




ttagatgcaa 


ccaaaagcca 


atccaaatga 






19561 


agaacaaatg 


gattggaaca 


tttggatgga 


aaacattggt 


gcccatgtgg 


cgtaatgcag 


19621 


ctctggaaca 


aaaatgggat 


ggaatattaa 


gtaacaagac 




cagggaatcc 


19681 


tggtctgact 


ctgacaaaat 


cagagaggtg 




ttctgaaaac 


actggagtag 


19741 


atgcttatta 


ttaattaccc 


caaatgactt 


ctagaaccgt 


taaagaactt 


ttttgaagtg 


19801 


tatttgttaa 


taaaaaaatt 


ttcaatactc 


agaatcacat 


ttaggtcaag 


ctttctaggt 


19861 


ccgatttctt 


tctataagat 


aaaaatgcct 


aa't'tatttac 


tactaactca 


tgcaagttcc 


19921 


tgagaaaata 


gcaaaataac 


tggctgctca 




ttagagcttc 


aatgtgcctt 


19981 


tggaggtcat 


acattagctt 


cagtttcttc 


atcactttcR 


ttatcagaaa 


tgaaatgaga 


20041 


attgtgtcag 


atggtattta 


tggttctttt 


caactctaga 


tggatacaca 


catctaggga 


20101 


ataataactc 


ctccccccca 




gaaagaagga 




cttcatcaag 


20161 


tccaccgatg 


atgttgtaaa 




aaagaagcgc 


cttgaataca 


ggattgtaga 


20221 


cagcattgca 


gcaattgccc 


ccgtagtcat 




ttggaatgct 




20281 


caattattga 


taaaagtcac 


aggcatgtag 


aggcagaggc 




tagaacaatc 


20341 








ccacagtgga 


aggggaaagc 




20401 


ccttcatgat 


caaccaacac 


agtattacta 




ttaagggaag 


gaatgccaag 


20461 


tttgccctca 


gctctctggt 


ttagactcta 


ttttccatac 


cacccttatc 


tcaggtactg 


20521 


ctaatccaag 


agctaatgtg 


tatgaaaggg 




ggcctaagga 


agatgggttt 


20581 


aaatccatct 


ctctccctca 




agaccagggg 






20641 


tttggctcag 


gggccaaatc 


cagcctactc 


cctattgcta 


tgaatattgt. 


ccgtggctgc 


20701 


tttttcacta 


cagaagcaga 


tcaaatgagg 


cccataaagc 


tgtaaatatt 




20761 


tcctttacag 


aaaaattacg 


gcaatccctg 


atcaaaccta 


gatgtaagtc 


cctctctttg 


20821 


tggagcagat 


ctctgtgaga 


acagaagaca 


cagggcagca 


cccttactct 


gcatgatttt 


'20881 


aatacacagc 


agtggtcgtg 


atcct'taaaa 


catic'tacag't 


agagaacaca 




20941 


acagggtctt 


agcatgacaa 


tacagtgact 


gcaaatccta 


gctttactat 


acctttgcga 


21001 


gccctacatt 


acccactggt 


agagaaactt 


cacaaaaatt 


ataaacataa 


aatgaattca 


21061 




tttttgttaa 


ccacatagca 


aacatctcta 


atgagaaact 


taatggccag 


21121 


gcatagtggc 


tcacacctgt 


aatcccaact 


gaggcgggag 


aatgtttgag 


cccaggagat 


21181 


gaaggctaca 


gtgagctaca 


atggcgccac 


tgtactccag 


cctgtgcaac 


aaagtgagac 


21241 


gctgtctcta 




cagatagata 


tctaactagc 






21301 


cagtcaaaac 


actgaaaaat 


gattagaaat 


gaactacaag 


taacaggtaa 


gttttttata 


21361 


gaggaatgcc 


tttccaagat 






cataggccgg 


gcacagtggc 


21421 


tcactgctgg 


cctgtaatcc 


cagcactttg 


ggaggccgag 


atgggtggat 


cacctgaagt 


21481 


caggggttcg 


agaccagcct 


ggccaacatg 


gtgaaacccc 


gtctctacta 




21541 


attagacagg 


ggtggtgaca 


ggtgcctgta 


atctcagcta 


ctcgggaggc 


tgtggcagga 


21601 




aacccaggag 


gcagaggttg 


cagtgagcca 


agatcatgcc 




21661 


aScctgggca 


acaagagtga 


aactctgtct 


caaaaaaaaa 


aaaaaaaatt 


tcattccatt 


21721 


tcatttcgaa 


accaagcacc 


aacatttatt 


tacttcacca 


cctaataaag 


tatgagagaa 


21781 


aaacttcagg 


aaggggtgtg 


tattatgtag 


aattcaccat 


tccttgtttt 


attaacaata 


21841 


tcatgtttac 


ttttgtcttc 


taactcccat 


ttataaaata 


agcattagga 


attatcatat 


21901 


aaatatgcca 


ataaatgtaa 


gtcactatgg 


ctttcaatgt 


aagtttctaa 


aaaagcatta 


21961 


acaattgata 


aatactgagt 


gaaatccttt 


ctacacattc 


aaaaagcatg 


tgtcagttaa 


22021 


ataaatatac 


tatatactcc 


agtaaccact 


cataagtgta 


aataatctca 


agtgttattc 


22081 


atgaataggc 


aaaacagaat 


gtgatcaccc 


ctgacttctg 


gcagctggct 


atcaactgaa 


22141 


gctggtcagg 


tgacagacgt 


taatggtgca 


tgttgtgcct 


cccatccaag 


ctcccagctt 


22201 


attcattctt 


tcgtatttct 


actcatgcat 


cctgccaact 


aagagcacag 


cactgtaaag 
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22261 


aggaaaagct 


tgtcaactga 


tggtgatgca 


aaccaagctc 


ctgtgtcaaa 


tctttcctgg 


22321 


gctctctcct 


gccacagaaa 


ttgatttgct 


ttgtttttga 


gggaagagta 


tcttcagaag 


22381 


aagaactcct 


cccaggctcc 


ttctaaaagg 


gtagagaacc 




tacaagactt 


22441 


taacaaatgg 


aatgtgctgc 


tgtttcacat 


tatgctactt 


taatgaccct 


gcattaatga 


22501 


tatcatttca 


tttctagcat 




ttcatctact 


aacctagtga 




22561 


atctgaccaa 


cagtcatgct 


gtaggcttaa 


aagaatttaa 


tgaattacta 


ttaatttaaa 


22621 


atagtgtaga 


aatgtacaaa 


agcatctacc 


tttagtcttg 


cttcaataat 


ctttgtcagc 


22681 


gtaagacagt 


ctccatgaaa 


accgctgcat 


ccaatgactg 


ttttgtctgt 


tctgtaaaaa 


22741 


gcacatttca 


gaaaactgag 


ctggttagat 


agtagagcaa 


aacatatctt 


cggcaatttt 


22801 


aaatttaaaa 


ttctgattcc 


cactattccc 


taaatggtaa 


tgggacaaac 


agtcaccctt 


22861 


ctggggagga 


ggggtctgta 


ttccatctta 


ccccttattc 


cagatgaatt 


acaggtggat 


22921 


caaagacttt 


aaaaatgcga 


ctatgaatgg 


actaaataaa 


tcatgggtga 


atttttttta 


22981 


ctatcttggg 


ctgaggaaga 


tacgtaggta 


ggttaattga 




agacggagtt 


23041 


tcgctcttgt 


cacccaggct 


ggaatgcagt 


ggcgcaatct 


tggctcactg 




23101 


ctcctgggtt 


caagcaattc 


ccctgcctca 


gcctcccaag 


tatctgggat 


tacaggcacc 


23161 


cgccaccaca 


cccagctgat 


ttttgtattt 






ccatgttggc 


23221 


caggctggtc 


tcaaactcct 


gacctcaggt 


aatccacctg 


ccttqgcctc 




23281 


ggattacagg 


tgtgaaccac 


tgcacccagc 


ctcgtgaagg 


tagatttaaa 


caacacaatc 


23341 


caagggccac 


acagaaaaat 




tgactacata 


aaattctaca 


gggaaattta 


23401 


agaatagctt 




taagaaaata 




atctaMcccc 


aaagtctaat 


23461 


tttaattcta 


taaaaattct 


tacaaataag 


gaaagcaata 


acataaagga 


aagatgggta 


23521 


acttctatga 


acagacaagt 


cacagaaaaa 


agagacatta 




aacataagag 


23581 


cctctatgtt 


actcagaaat 


ccaaatctca 


aaaagagact 


atttcaacca 


attatactgg 


23641 


ctaaaaatgc 


ctaactaact 


aaatgctggt 


gaggttgtgg 


aaacagtata 


aactgattaa 


23701 


cctgttagat 


gggcaatttg 


gtggtattta 


ttaaaaattt 


aaatacacac 


cactctcacc 


23761 


aattctattg 


ctagaaatcc 


atcttgttaa 


tat act tat c 


caagtatgca 


aagggatgct 


23821 


ctagactcac 


cagtgagtac 


tcatttaaat 


agcaccccaa 


aacaaaacaa 


aacctaaatg 


23881 


tccatctgta 


tggagaggat 


taaaatttgg 


tatagacata 




cgtgtatttc 


23941 


aaaaattaat 


tcctagatct 


tgatatgaat 


ctataaagaa 


acctaaacaa 


tattttgcag 


24001 


aggccacaca 


attcctcttg 


gaaacttacg 


aatgactcgc 


acaaaccata 


cctgtcctac 


24061 


taggagagca 


gctgacacct 


ctagggtcat 


cactaacatc 


acctgatcta 


tgcctaccat 


24121 


caacctcact 


gcactaaggt 


gaagacgtcc 


aatcccctat 






24181 


gacaagattg 


gttgacaata 






tcttagaact 


tcaagatgct 


24241 


tttcccaaat 


atccaggtgc 


atcaaaggaa 


tgggagtttt 


catgcaaaga 


atacagtctg 


24301 


aaacaggttt 


ggaagaggca 




taggtcagMc 




tgggatgagg 


24361 


gatatataca 


ggcattcgtt 


aatgctgcat 




ctctatctct 


atatctgacg 


24421 


tgtttcacaa' 


aaaaaaaaaa 


aaaaagtgct 


cacttcacca 


gcaaacgtaa 


ctaaagcaat 


24481 


atttaaaaga 


tgagtaaaag 


ctagtacaag 


gatggtatcc 


ataaagttgt 


tttaaaatct 


24541 


tatttctaat 


atttactact 


ttcaagttgt 


acaagtgtcg 


tccttgagga 




24601 


taacacaaga 


gcaccataaa 


cagaaagcag 


aaagggggta 


tcaaaagatg 


caagtggaga 


24661 


gaaacagaac 


tgggaagacg 


aaaacaaact 


tcattgcttt 


ttaagatgtg 


ggccatccct 


24721 


aggagcagga 


aagacaacgt 


atcttttctt 


ctgtacctac 


ttcctacaat 


acaaggaggg 


24781 


tccatccaaa 


ggacctaaac 


ctcgtaagtc 


ccattcctat 






24841 


cccaggaatt 


catgaccatt 


tataagcatt 


tccaaaactg 


gtaaatacag 


accactgcca 


24901 


atctgcagta 


tgtattcagt 


atttatgcag 


gctttttgtt 


tttttaagtt 




24961 


tttcatgttt 


taggaaaaac 


atagctagcc 


tattaaaact 


gagctgtgga 


cataattgct 


25021 


taggatattt 


ctaaaacgaa 


tgtttcaggt 




agtgtgggga 


ggcagattta 


25081 




catttaatgg 




ctgtggtttg 






25141 


tgttgaaatt 


taattgccat 


tgtgatggta 


ctgggagttg 


ggaccaggtg 


tttaggtcct 


25201 


agggctcagc 


tttcatgaat 




acagcagtgg 


gttcgcttgc 


tcttctttct 


25261 


ttctctggcc 


ttccaccatg 


ttaagacaca 




tttcatggta 


aaatgctggg 


25321 


gtgaacacat 


ttaggttacc 


gaaagcactt 


ttgqtaccct 


gaatacagca 


aatattatta 


25381 


agactgcaca 


ttaaattatt 


aggaaacatt 


aacttagaaa 


atggttttct 


aataaaaatg 


25441 


ctcccaacag 


caacttaaaa 


actcatgaaa 


caaatcattt 


agaagtagaa 


actctcacaa 


25501 


catYaaatca 


ttacaaaggc 


attgtgaaat 


gtctttagaa 


atatttactt 


acaatttgta 


25561 


acatttgggg 


ctatcccgcg 


tatgaattga 


aaacccttca 


ctcaatcgag 


tatcagaagc 


25621 


aacaattgca 


aaatcttctc 


cagcaattgc 


cagtatagta 


ctgaggaaaa 


aagaaaaaaa 


25681 


ttaattctcc 


agggtggtaa 


tcctatccct 


acaaatagaa 


gaatgctcca 


tagtacataa 


25741 


tgggataaaa 


tactctagat 


gtcaacaaaa 


acatgattca 


aatgggaaga 


ggaaagatga 


25801 


gcgggaagag 


aatgaacgcc 


tggctacgag 


ttgtctggga 




attaataagc 


25861 


ctaaatcagg 


gcaaagtctc 


cttggcagga 


gttaacagaa 


aagccaatga 


attatcatca 


25921 


ccaacacatt 




tcgcgcaagg 


tactactaat 


acagaacaac 


taaataccac 


25981 


atctgtgccc 


ttgaggatca 


ggtatagaca 


gtggtactac 


aacgcaagct 


ctatgagttt 


26041 


agagaagatg 


agattttttt 


ttcttgcttc 


atttctttat 


atccaagtcc 


ttatataacg 
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26101 cctatataat gcttatttct ttatacccaa tcccttatat aatgacaaat agatggacaa 

26161 acagtaaatt tttccctctg tggctgtaca atttgacagc ttatcaaaga gacttacagt 

26221 agaattccaa aagcagaccg cctgggttct aattctggct ttcccgtttc gcagatatga 

26281 gactgtgggt aagttacttc tcaaagcgtc aatttcatca tatatacaac agagatcact 

26341 gcagttgcta cctcattagg gtgttcaaag gatcaaatat gtaagccctt atagcagtcc 

26401 ctgacatgta actggtcctc tagtaagtgt tagctataag tgctatggca ctggagtatg 

26461 actaagcacc tgggctctgg aattacatga gacagagacc cactcttgct acttactagg 

26521 tatgtgatct tggacaaatc ctccaaatgc aagttgatga taacagtacc tgtgtcacaa 

26581 ggtgtgtata tatatttggg tgtgtatatt ttaatgtaca aggcttgact gataactata 

26641 accactgctt caatgcaata gtggaaatta aaggcatggt gcctcacaga cgtaagcact 

26701 caggaaactt aagccactat ttttactgag gagggatttg tgctaaagct ctcaagaaga 

26761 aaaggatggc attccaggta atataaacag caagcaatgg caaacaggta attattcaaa 

26821 tagtacatac attcaagcaa ctcattcagg cagccctttt tgcataagca catgtagtga 

26881 cgttaaggtt tatgtgatgg acagggttcc tactgtagaa aatcccaaat gccaagctaa 

26941 agattttgga attttagcaa gaaatcatga aggtattctg agcaagaatg atctgtagtt 

27001 gtaactactc aagaggctga ggtgggagga ctgcttgagc ccaggtgttc aaggctgcag 

27061 tgagctatga tcgtgcctgg gcattagagt gagacctggt ctttaaaaaa ggaatgcaag 

27121 agagagaaaa gttccattta caaagtgggg ttttaggaag actgctctga caaYaacata 

27181 gtatgtgaaa tgggacagaa acactgttct aatactacta atgcaatagt aaggtagcag 

27241 ggtgaacagt aaatccaaaa tcatcacaaa cacacaaaat agacaaattt ttatatctac 

27301 gcaaatgttt taggaactgg gaaaaccaat tatgacatcc aagatttaga acttagatga 

27 361 acagaatgat ggcataatta taagtatttt aaaggagagg aggccgggca cggtggctca 

27 421 cacctgtaat cccaacactt tgggaggctg aggggggggg ggggggtcaa ttgcctgaga 

27 481 tcaggagttc gagaccagcc tggccaacat ggtgaaaccc atctctacta aaaatacaaa 

27541 aattagccag gcgtggtggc aggcacctgt aatcccagct actcgggagg ctgaggcaga 

27 601 aatgcgtgaa cccaggagtt ggaggttgca gtgagctgag atcgcaccgc tgcactccgg 

27 661 cctgggtgac agagtgagac tctgtctcaa aaaataagaa gaaaggagaa gaggagatga 

27721 aggggaataa ttagcttgct ttttgttttg ctagctgtct tgagttgccc tgagagcaga 

27781 aaaaccagtt aaaaatgttt tactgaagaa gccgaatcga gggactcatg agaggcagaa 

27841 ctggaaaacc agatttggga gtaatcctcc cagcaatgag acatgaaaga gtgctgagcg 

27 901 ataaacaagg cggctaatga cttaactaca tttaaagaca gagtaggaaa agagaatgag 

27 961 gcctcatttt gcggaagcga aggctgcctg agagccagct gcagtaaWca ctaaagaaaa 
28021 agaacaatga ctgagaaaaa gtaatcagaa agatctaagt aatttttagg gcagtaatgg 
28081 cttaaactgg attacaaggg attaaaaagt gagtaacgag tagggcatac tgaacactga 
28141 aaattcttat ttatagagaa tagccttacg aaacgggtcc aataaccctc cctacaatat 
28201 acaacttaat tagtcatcac aggaagtgtt aaggtgtata atggaaaagc atccataaac 
28261 tcagtggtga aatagctatg aattaagtcc tggctcaact tcacaccagc tctctgaccc 
28321 tgacagttta acgtctaata taaccctagg atgctaatat catctaacat tcacttttca 
28381 tgaggattaa ataagatgac agcttgcaat ttacaaaatg catctctctt gattctcacc 
28441 aaaaactatg aagctactaa ggaagataag gaaatttagg ttcaagaagt tcagaagtac 
28501 ccaaagtgtc ctttagtggc agaaccaagg ctaaaatcag actttcgtta tctttctaac 
28561 acactcccaa aatgtgcatt tatatttcaa atttatgagg aaccaattaa catttttgct 

28 621 ttgtttttaa aatttatttt tgtagagatg gggtcttgct atgctgcgca ggctggtctt 
28681 caactcctgg cctcaagcga tgatcctcct gccttggctt cccaaagtcc tgggattaca 
28741 ggtgcgagcc acactgccca gccaatattt tctgttttaa gaaccatcgg ttcgttcaaa 
28801 ttgcgtgtgt atattttaat gtacaaggct tgattggtaa ctataaccac tgtttcaatt 
28861 tacagctctt ccctgtcaag agtcttaaac agagcatctt tctataaccc taaatctctg 
28921 gcgtgccacc acggaaaatt atactactca agataaagct ggtaattaaa ataaaaacca 
28981 aaacttgaac ataacataca agaacacaca tactaaaagg tccatcttct gagtattttg 
29041 ttttcctgaa cttaagctaa acgttaaaaa aaaaaagcac ttatctatga aactaagttt 
29101 gctcagccaa tcccaccttc tatttgaaat aaaacaaaat gattaaactg ctacaattag 
29161 aaataacaga aatcaggcgg ctacaattag acatctcggc taccaaccca gctatgcatc 
29221 taacaacaca gaccaaacaa ccctaacttt taagtttcag acgctaaccc tctaccctcg 
29281 ccggctggca taaRaaacgt gtacatgagg tccagtttta atggtcttcc acagagcaga 
29341 ggctatgttt caatttctac tttactgtct tacagcagca aggagcacgg agtggcggtc 
29401 cacataaaaa ctcaaatgac atgactgtaa tgggaaaccc taaaaaccaa ggctgtatcg 
294 61 caatcaccaa gtaaacttga gcaaagcgag cctgaagagg gaaacacagg gcatgagagg 
29521 acggcaggga gaccggcctt gtgcggaccc cctcagctca gggttctgag gcctgcagga 
29581 gcccggggca gcgccatcac ggcggtgact cctaaatagg cttcagcaga tgggggaagg 
29641 gcgaaagtga aagccgcagc tctctggggt ttttaccctc cgttgaaaac gtagggcgaa 
29701 aatcgcagct gcaaagggcc cgcggctctg tgcggttcca tccccaagtc tctgccagSa 
29761 gccgaataca tggctgtaga ggacaacatc gcacggctgc gcctgcggat ccgacacttg 
29821 ctgtctcacg gcgagatggc tgccttgacc ggacgttacg ccacttccgg cttctcctga 
29881 agttcgctcc cggcctctct atctcacgct agtcgttgct cctggagggc tgcacggcgg 
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29941 cttgtccctt tggtagttga atcccgccca ttccaaaaag cgctgacagg gatgtaaagg 

30001 gttttttttg tttgtttttt gtttttttcc ccctcgaaga aaacattgga attcacccca 

30061 atggacaaaa atttaagtct gaccatacaa aaaaattgtc agaactatgg cgcaacggca 

30121 actcgaataa cggtgggaac gttaattgtc ctggctaata aaaaatgtat ataacatttc 

30181 ctatccttaa agagctcaca acctcactga taataaaaag tacaaagaaa acaagcagta 

30241 taacatatga ttacgccaca atgaactaca gaagggaaaa tcaaggcgtg ctgaagtccc 

30301 actaagaaac aactgcggaa agagccatgt gacaacagtg catgaactgg gagtggcaga 

30361 actgaatata aatgcatgtg taaacacaag ctgtttgttt tgcttagtgt tccttgtcat 

30421 tctacacgct tgaagatcag ctagcgttct tgctgacagg taaggaggac gcgcttactg 

30481 agtgccaagc actgctcagg cactgattct gtcaatctct gtcaatctcc cgacagccca 

30541 agggtaagca ctgttatcat tattcaattt tacagaaaaa aaatgcgggg gagaggtcag 

30601 gtaacttgtc gaaggtaacg ccgctagttg ctttaaacaa caacaacaac aacaacaaaa 

30661 cacactcaca catatacaca cacacgccat ttaaaaatcg atctttccta cgtccagcaa 

30721 gggccaatta gagatggctg tggcacggcg gccccgcccc ggaactcctc aagagcttcg 

30781 cccctcctta cctatggaaa cacaggaagt gacctatgct cacacttctc aYggcctcgg 

30841 ccctagtggg agcaactcgc tgaagccgag ggcagaactg gcggaagtga cattatcaac 

30901 gcgcgccagg ggttcagtga ggtcgggcag gttcgctgtg gcgggcgcct gggccgccgg 

30961 ctgtttaact tcgcttccgc tggcccatag tgatctttgc agtgacccag gtaacagatt 

31021 gtactctttt ctgacggttc gggcgaaggc caccactgca ctgaggcctg ggggcaatgg 

31081 tggggaagag actaggaatt ggcgcgcgtg caggcccctc gggggacgtt cctccctttc 

31141 gtgctgccgc cgttccggcc tgtaacggcc actcggccgc cactcccgcc tggtgcccta 

31201 ctctgctgtg tttcgcaggc agcttcccat cgtacgattg tggggctcag ggtactactg 

31261 gctggctggg cggcggcagg cgggacagga cagtcccttg catcgaagac cctaagttta 

31321 ccctgccctg tcctgccatc cgcttcttct ccatgttaga agcagattca cccagatctg 

31381 tgcccgcctg ttttgctgcc aacattgaga cttaaatatt ttgtcagaag cctgagacag 

31441 cgggcacggt agcgcttaag atataataca caccacttta tttgcagggt ctcccgtctc 

31501 tcggttcagg ccatcatggt tttccaaatc tctaggtaga cttttctgtg aaaagactgt 

31561 gcttcattta gttatacaga cactagaagg ctatgcagaa ttaatttgat tgcctccaaa 

31621 aaatatcgga tttgatgttt caatttccag gagatgaaga tacccagcaa acaactcttt 

31681 tctgaggata aattagtgca gtaatcactg tgcggtttct tctgtagact tacttgcaaa 

31741 aagtggcctg aagccaccga aggtcctgga taaatctcta atcatactta taatggcttt 

31801 aaatcctgcc gtcattatct cttgcctcaa ccttagattc ctgaaacgaa acttccgtcc 

31861 tccagtttta ctcctctcaa attcatctag tcttgccaaa ttagatctgt tcatactgca 

31921 cttccgaaat tccataactg ttattattgc ctatgcaata acattgaaaa ctcctgatag 

31981 tatgagccca ccaatatgtg ctgtctcatc tgctgcagtg accttctata cagtcatact 

32041 aagcttgttg cctgcatact gcatgctttt tcaatctgtc tctttctgct tgatttctct 

32101 tttgtctgaa gccctgatgt gtaaattcct actcaccttg tgagacccaa gttagatggt 

32161 ccctgctttg tgaaaacact gcgctctctt cacagtgatt ggctgttagt ctatattgtc 

32221 ttctcttcca ggggtgtata tgggctcatt catgatcaca tactgtattc caggcatagt 

32281 gctagatgca gagatcacaa agacatgtag gctggtttct gcattcaagg aacttagctt 

32341 agaccatacc tgctgttata atactatgtt ttacagtagt tatttgcata cccttcatat 

32401 tgaacacttt gatgccaagg actatatcct cctatcttta tatcctcatc tgcaggactt 

324 61 ctgttattgt tattatagga taactgtcaa aaaaaaagta tattttaaaa aatatctctg 

32521 atatatttat ttccagaagc agagcttgct ttcttttttg gtctgttttt cagtgatgag 

32581 tatgtaggat agatagtctt tgggggcctt tgccctttca aagtgatcgt cagagtcttt 

32 641 catacattca gcaaatatct gagtgtctgt tctgtaccag cacatgcttg aagtgcatat 

32701 gcctgaagga tctttggaca tataatttgt aactttgaga cctctaagtt ctatgtgaga 

327 61 atatgttgtt ataaactcat ttcagatgtg tagtgagtaa agcgatgatg atttaagaaa 

32821 agtcagataa caggcacagt ttgcattaat gtgttctaaa gaggtaaggt tattacattt 

32881 ataaaaattc agggctttat ctttgtgcgg cttttttttt tttaacagtt tcattacagt 

32 941 aggagcttga taaatgatca ctctgaagta tattggattg aatttgatat ttacttaatt 

33001 ttttgcccaa gacattgtag aggatgtaaa attggaatat ttaaagatct aaactttgcc 

33061 taacagtgct gtgtatacag tgcttagtga atattctgct ctgatattac attttgctta 

33121 ggaattattt ttctctaggt gtttttcctc aaaagttttt aatgctggtt atgacagctc 

33181 gattttgagc attttccgat tatttaaaca tgtaacaaaa tgatttttgt tttgttggcg 

33241 attttacatg caatcgccgg aaacatggaa ggaataaaac tttaggatta taaggtaaaa 

33301 acaaatgtat tccaaaatag cttcattggt tttcatgttt gtgttttgta tagccataga 

33361 actggcttat aggactgtac aggttacctg gatccttaaa ttaaacttta gacttttttc 

33421 caaagcagca tcactgtttc ttggcgtgtg aagataaccc aaggaattga ggaagttgct 

33481 gagaagagtg tgctggagat gctctaggaa aaaattgaat agtg agacga gttccagcgc 

33541 aagggtttct ggtttgccaa gaagaaagtg aacatcatgg atcagaacaa cagcctgcca 

33601 ccttacgctc agggcttggc ctcccctcag gtaatatagc aggagggaga gaatagggag 

33661 ggcggaaatc tgaactgcaa gagatggtat caaaaggcaa ggaagggcat ttaatgatct 

33721 gtttttgaaa atggtttaat atgtttttta agccttattt tgttgagaag ttctattagg 
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33781 ctttgaatag gcacaatggt gtttatttgg gaagtctgga agctaagtta tatatttatg 

33841 aaacacctaa tcttttgata aacacttatg agaatgttcc aaatgactat atagggtctt 

33901 tgattttgaa aatcaccctc accaaattat tttctagttt tatttttccc cttttacatt 

33961 tcaagctctc aaccacccat ctactttaaa aatttttcag cctgggtgca gtggctcacg 

34 021 cctgtaatcc cagcactttg ggaggccaag gtgggcggat cacctgaggt caggagttcg 

34 081 agaccagcct ggccaacgtg gtgaaacctc atctctacaa aatatacaaa aattagccag 

34141 gtgttgtggc gggtgcctgt aatcccagct actcgggagg ctgaggcaag gagaattgct 

34 201 tgaacccggg aggtggaggt tgcagtgagc agaggtcgtg ccactgcact ccagcctggg 

34261 agacagagcg agactctgtc tcaaaaaaaa aaaaaaaaaa aaaaaaatca ataacttttg 

34 321 ttgaaatata catacaaaaa aactcatttt aagtgtctgt tttgataatt atatataatg 

34 381 tataaattac agaacatttc agttacctta aaaagtccct tcttcccctt tatagtcact 

34441 ctgctggccc caggtaacta ctgcatctgc ttttcaatgc tgaagattag ttttgtctat 

34501 tctagaattt catatagatg gaatcagagt gtatgctttt ttgtgtatgt ctgacttctt 

34561 agcccagtgt actgtttgta tatcagtagt taatccattg tatagctaag tatcactcca 

34621 ttgtttggat gttccacagt tcatccattc tccagttgct cacatttggg gtctttccag 

34681 tttggagcta ttgcgaataa aagcactgta aacatttgtg tagactttga atgcactgtt 

34741 tttacttctc atgggtaaat acttaggagt aggattgcta ggtcctatat tggtatatgt 

34801 ataactttat aagaaactgc caaactgttt tttgaagtgg ctgtattgtt ttgcagcata 

34861 agagatttaa gttgctccac atcctcacca acactttctg ctgtcagtct ttttactttc 

34921 actctagtga ttgcttagta gtatcttatt gtggttttga tttttatttg cctgattact 

34981 aatgtttctg agcaccttgg caagtgcttg tcagccactc atacacaggc ccacctcact 

35041 ttactgcact tcactttact gcatgtttga caaattgaag gttgtggtaa acctgtaccc 

35101 agcaagtctg ttggcattat ttttccaaaa gtgtgtactc acttcatgtc ttggggttag 

35161 attttggtaa tttttgcagg atttcaaact ttttcattat tatccgttat tgttatcaaa 

35221 cttttttttt tctttttttt tgtatttttt ttttaatttt ttttttcagt cactctgttg 

35281 cccaggctgg agtgcagtgg tgcagtcttg gctcactgca aactccatct cccgtgttca 

35341 agcgattctc ctgcctcagc ctcccaagta gctgggatta caggcatgcg ccaccacacc 

35401 cagctaaatt ttgtattttt agtggagacg gagtttcacc atgctggtgt caaactactg 

35461 gctgcaagtg atccatccgc cttggcctcc caaagtgttg ggattacagg catgagccac 

35521 tgtgcctagt caaacctttt cattattatc tgttattgtg atcagtgatc tttgatgtta 

35581 ctattgtaat tattattgaa tgccataaac tgcacctgta taatacagga acttaattga 

35641 taaatgtctg cagcgtgacc aaccattccc ccatctctct ccctctgctt ggtcttccct 

35701 attccttgag acacaacaat atggaaatta ggctaattaa taaccttaca gtgactttta 

35761 attaagtgtt cagatgaagg gaagagctgc acatctctca tttgaaacca aaagctataa 

35821 atgattatac ttagtgaaga aggcatgttg aaagctgaga caggctaaaa gctaggcctc 

35881 ttgcaccaaa tagttagcca agttgtgaat gcaagggaaa agttcttaaa ggaaattaga 

35941 agtgctactc cagtaaaccc acaagtgata agaaagcaaa acagccttat tgcttatatg 

36001 gagaaagttt gaatggtctg gatagaagat cataccaccc acacatcccc ttaagccaaa 

36061 gtgtaataca gagcaaggcc ttaactctct tcaattctgt aaaggctgag agcggagagg 

36121 aagctgtaga agaaagtttt gaaactagca gaggtttatt cagagcttta aggagagagg 

36181 ccttctccac aacataaaaa tgcaacatga agcagcaagt tatctggaag atctagctga 

36241 gataattgat gaaggtgact atactaaata atagattttt aatgtagatg aaacagcctt 

36301 ctactggaag aagatgctat ctaggacttt tatagctaga gaggagaaat cagtgcctag 

36361 cctcaaagct tcagaggact ggctgactag ttaggggcta atagagctgg tgagtttaaa 

36421 ttgaagccaa tgctcatgtt ctgctctaaa accatagagc cttaagaatt acgctaaatc 

36481 tactctgcct gtcctcagta aacagaacaa caaaacctga tgagagcacg tctgtttaca 

36541 gcatgattta ctggatattt taagctcttt gagatctgct cagaaaaaaa gtttaatttc 

36601 aaaatattac tcactgacag tgtaactagt tgtccacaag ctctgatgga gaagaacaag 

36661 gagattaata ttgttttcat gcctgcttaa ataatatatc cattcttcag cccatggatc 

36721 aaggagtaat ttcaactttc aattcttact atttaagaaa tacagcaggg catggtggct 

36781 catgcttgta atctcagtac tttgggaggc caaggtggaa ggatcactca aagctaggag 

36841 ctcaagacca acctgggtaa caaaacaagt ccctgttgct acaaaaaaaa attttttttt 

36901 aattagctgg tcatggtggc atgtgcctgt agtcccagct acttgggagg ctgaggcggg 

36961 agggtcactt gagtccagaa gttaaaggct acaatggaga ccctgtctca aagaaaggaa 

37021 gcagggaggg acatatgctg tagctgccat agatagtgat tcctctgatg aatctgggca 

37081 aagtgaattg aaaaccttct gaaaaagatt caccattcta gatgccatta agaatgttca 

37141 tgattcatgg gaggagggtg aaatatcaac atgaataaga gtttggaaga ggttgattcc 

37201 aaccctcgtg gatgactttg agaggttcta gacttcagtg ctggaagtta ctgcaggtgt 

37261 agtggaaata gcaagtgaac tagaattaga agtgaacctg aagatgtgac tgaattgctg 

37321 caattttttg ataaaacctg aacagatgag gagttgcttc ttgtgagtaa gcaaagaaag 

37381 tggtttcttg agatagaatt gactcctggt gaagaactga tgactttaga atattacata 

37441 aacttagttg ataaagcagc agcagggttt gagaggattg actccaattt tgaaataagt 

37501 tctagtgtgg gtaaaatgct gtcaaatagt atcatatgct acagagacat cttcagtgaa 

37561 aggaagagtc agtcagtgtg gcaaacttcg tcagtcttat tttaagaaat tgccacagct 
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37621 accaccctga tcagtcagca gccatcaaca tcgaggcaag atcctctgtc agcaaaaaga 

37681 ttatgatttg ctgcaggctc acatgattgt tagcattttt agcaataaag catttttaaa 

37741 ttaagttata tacatattat tagacataat gctattgcac acttaataga ctttagtgct 

37801 aacataactt ttgtaggcac tgggaaacca aaaaattgat gccgcttgct ttattgagat 

37861 ggtctggaac ctaacctgta gtatctccga ggtatgcctg tatcttcatt tgtaatatgt 

37921 ccttcacatc ttttgccctt ttttattatt ttatttgttg atcttctttt atggagttgt 

37981 cagagctctt tattattctg tttaccagtc ctttctcaga tgtatgtatt atagttattt 

38041 ttttcccagt ctggcctgcc ttttaatttt ctcaatggtg tctttcaaag aacagaagtt 

38101 tttaattttt ccgaagttca gtttatccat ttttcttcat gtttatccac tgtgtggtat 

38161 taaagaagaa agcaatgtgt ataagaatag ctggttcttc cgtaattaat gtttaataac 

38221 cccattattc tccgaaggca tctgtctttg cacacctgac ctgctgttcc accaagaaag 

38281 ttccacaaac acttagcagc agccagccta acctgttttt ctccttgctt tccacagggt 

38341 gccatgactc ccggaatccc tatctttagt ccaatgatgc cttatggcac tggactgacc 

38401 ccacagccta ttcagaacac caatagtctg tctattttgg aagagcaaca aaggcagcag 

384 61 cagcaacaac aacagcagca gcagcagcag cagcagcaac agcaacagca gcagcagcag 

38521 cagcagcagc agcagcagca gcagcagcag cagcagcagc agcaacaggc agtggcagct 

38581 gcagccgttc agcagtcaac gtcccagcag gcaacacagg gaacctcagg ccaggcacca 

38641 cagctcttcc actcacagac tctcacaact gcacccttgc cgggcaccac tccactgtat 

38701 ccctccccca tgactcccat gacccccatc actcctgcca cgccagcttc ggagagttct 

38761 gggattgtac cgcagctgca gtgagtactt cgtgttttat gtttcctccc acttaggagt 

38821 ccctttgagt tatgttcctg ctctgttttc agatggatcc ttttattaag ggagggagtg 

38881 gcactaacgg taattgtgta tcaaaatttg ctttatctca catttgggaa agggaagcaa 

38941 agctatctta gtcagtgtcc tcagtaaaag gctcttaaca ggtttagaaa tgtggtcatt 

39001 tgtgtttaca tacctgagcc aataaaattt aatctgactt tcactgtcgt tattattata 

39061 ttatagacat ttccctgtat ctgatatcgc taaatcacaa tgttaggtag tctctttccc 

39121 ttatgctatt ttaggtcttt agtcacaata tcagtataat ttctggtagt tcttgttttt 

39181 tgtttttact atgggtgctg catataaatc tttgaaggtc tgtgtccttc ccacaaaatg 

39241 aagacgactg tttttgtcat aaatggattt ttctacctaa atgaagtggg ttctatatgt 

39301 aacagtgtag taggggtagg aaataatttc atcttcctga aaaaccagca aatatttcca 

39361 aataattcca gtcaataaga agtgtatagc ttttcatttt agaaagctta tgaccaaatt 

39421 aaaaggttac ttgcagtctc tgcatctcct cagttttctt gtacagatac cttcctcctt 

39481 catactccct ttagatctag tatttcctat ttgcatttat tcaccttgtt acatatatgg 

39541 ttgttgttta taagctatct ctcatcattt ttggaactaa gtggtgtgta atcgtggatg 

39601 ctgactttca cccaaagacc taaatcttgc ctccacaatt ttgtttgacc ccacacacaa 

39661 ttttaaaaac aaagtaattt tagtgatttt agagagagcc ctttttagct cacccttgtc 

39721 tataccactc ccatgtgtcc agcatgcttg tctgactccc aggaatattt aagtttgcat 

39781 ttctgattta aataataaat tataaacagt acagtcaggt gtttgtttct tgcgagtgcc 

39841 tttgctgttc tgatactccc cagagcatcc agtcatccac attcctgaat ttccactatc 

39901 tactaagaac cctcccgggt ggtagttcag tgccctctcc atcttgtagg cttaagaggg 

39961 attaaggtaa atatttaatt aatgaaaata tataagaata catttaacaa ttatgtgact 

40021 aaagttgtga tagatatact aaaatctact tggtatagtt cataagtgga caaatgctaa 

40081 aggcaggtgt gtatacaagg gtgattggaa ggaaatagat gatatggaaa tgaaaacatt 

40141 cattgtgtgg atcaacctaa ttggctttac agttgtgtat catagactgt acaagaaaaa 

40201 gaaccagaag caacaatgag ctgaaaccta ttgtatttga acctatgtat ttgaaatctg 

40261 gactcgatat tcttgggatg gtgatttcca ggtccatgcg agacagttgt tgagcagtta 

40321 attgcattca aagggcattg cccatttcaa ctatatatac tttcataagc agaaaaatgt 

40381 ataatgaaaa tgtgggtttg ttgtttaaat ttttattaaa atttctaata gagttgcttt 

40441 ttagtatatc tttatccttg tataagttat tcgttaagtg atttaagaac actgagaaat 

40501 gaaaagggtc agtggaccca gatggctcca tttatcctta tttactgaga gttaattaga 

40561 agagaaagca gggttttgcc tttttttttt gctaaagaca cttagctctt gtttttcaaa 

40621 agatacttac tttgagagaa ctgggcagtt catgcctgct tgttttcatt ataaatagtt 

40681 acttaggaat agggtggtgg tggtagcttt tcagagctca cagatgagaa ttgagtttgt 

40741 ctgtgagaag gctaagggca ggctctggag acagacagac tcaaatcctg gctcctccac 

40801 ttgggagcca ttgcgtcctg agcaagttgc actctacctt taagcctcag cttatttatc 

40861 caaatgtggg aatactggta acttctacct cagtgcgtta tatgtattaa atggaataat 

40921 ctatataaaa tgctcaacat gtgcatgata cttgttaaac aataaatgtg agcaattaaa 

40981 aaaaagtaaa gcctggttga aataatcaga tgtctgcata atttctaacg cctcatccaa 

41041 tgaaacttaa gtaatttaaa tagtcgtgtt ttctttttaa atctcttaca gaaatattgt 

41101 atccacagtg aatcttggtt gtaaacttga cctaaagacc attgcacttc gtgcccgaaa 

41161 cgccgaatat aatcccaagg ttagatctat tttaatgtat ttcttttttt tttctttttt 

41221 gtctcctctg ccgtgtctct atattttaat gtatttcacg ctataaaaca aatgtctgta 

41281 gatcaggcca ggcacagtgg ctaatgcctg taatcccagc actttgggag gctgaggaag 

41341 gagaatttct tgaacctagg agttcaccac cagcccaggc aacatggcaa gaccctgtct 

41401 cttaaaaaaa aaaaaaaaat tggccgggct tggtggctca cacgtgtaat cccagcactt 
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41461 tgggaggccg aggtgggcgg atcatctgag gtcaggagtt caagaccatc ctggccaaca 

41521 tagtgaaacc ccgtctctac taaaaataca aaaatttgcc gggagtggtg gcatgcgcct 

41581 gtaatcccgg ctacttggga ggctgaggca ggagaatcac ttgatcctgg gaggcagagg 

41641 ttgcagtgag cggagattgc gccattgcat tccagcctgg caacagagcg agactccgcc 

41701 tcaaaaaaaa aatctgtata tcaaagagtt tgtgttatgc ttattccttg acaccaataa 

41761 aatgaagatc taaagtaaaa tgtgcattgt tttgtatctt ttattgagtg tctgtgatat 

41821 aagaatccct gggagtacag cagtctataa aataagctaa gttatatatg tattgtattg 

41881 ggatcatgtg aagaattttg tatatgtata gggatgatgt tagccagcta tagtaggcaa 

41941 agtaatttgg ataaacatta gctgggttga gagaaatgtt aagtaatgga cttagggtgc 

42001 attttaaaat ccctgaaaca taatgcctga gatctgaaag acagctaagg gtctgaaaat 

42061 caccttttaa ccactcataa tgagtctgta tgacatgtag cattctaaga attaatcttt 

42121 catctattat aatcacattt agttgaatac ataattttat tactttacgt tcttaagtga 

42181 tatttaacca ataaaaatag tagaggaaat tctggaaatt caaaatagct aggcttttcc 

42241 tgggacgtat ataaactact atatactgct agctttttta gatgttgata aaatcacagg 

42301 gaagacatag agcaggtttt aaactaaaat tgttaaaagc caggcatggt gactcagacc 

42361 tgtaattcca gcactttggg aggccaaggt gggaggatca cttgaggcca acagttcaag 

42421 acgagcctgg gcaacatagc aaggccttgt ttcatcagaa aatttaaaaa attatctggg 

42481 gttggtggca cacacctata atcccagcta cttgggaggc caagacagaa ggatctcttg 

42541 agctcaggaa ttcgaggctg cagtgagcca tgattgcacc actgcactcc agtctgggca 

42601 acaaagtgag actcttgtct ctaaaaaaaa aaattaataa aataaaaaaa taaaattgtt 

42661 aaagcaatct agtttgtcaa agagtcgtct taattagaaa catgtaggtt tttattgtat 

42721 aattaaatgt attaaaatat ttaagttgtt tcaaagtaat tatttgcttt ttatcatgta 

42781 cctaataact gaggccatta attcaaccca cagttgaaat attttcccct tgagttacca 

42841 ccttagcaag aaaattttct gaaagaatta acatgtttca agcataaaaa gcagaaaata 

42901 aaagttaaaa ctgtgttttt ttcctaattt gcatttatgt gtttaataca ttaatattgt 

42961 ttctaaagac caattgaaag catgatttta cttcattacg tattgatagg cttgtaatct 

43021 gatttattaa ttccattaag gtagaaaagg tttattggtt tatttgcctc aaaattcata 

43081 ttaataaaac aaaatgtatc agagaagcta tagtcatgat aagcagcaaa tataaagtag 

43141 aaacagaagc acttgtattt attgagaaca taaccaaatt ttgcaaacat ttctcttgat 

43201 ggaatgctca tgtgaaagaa catacagatt tggtataact tagtcacaat tgattatttt 

43261 ctgctcctta tgcaaaggaa tatttgttac atgtggtgta tgcaaatcct ttacagatat 

43321 acagaaatac agaacaaata ttttgataac tagatgtact atgtccttcc taccagttgt 

43381 gatttttttg tcaatgggtg gttcgcctaa caacattgag cagtttggca tgacctcact 

43441 aatgattctc tctgaccatt gtagcggttt gctgcggtaa tcatgaggat aagagagcca 

43501 cgaaccacgg cactgatttt cagttctggg aaaatggtgt gcacaggagc caagaggtag 

43561 ccgtaagaaa ttcattcttc tggtctatgg gttatgaatg aaaaggtgat atctcattgt 

43621 ttttaggtta ttaggttagc actttaacat gttattattg ctttcttata aaaaccattt 

43681 taatatgatt ctattaatta tttttattta tttatttatt atttgattct attaatgtta 

43741 gatagcagtg atttcatttt cttaaaaatt agatatgggc aggcgtggtg tctcaccaag 

43801 gcgggcaaat cacttgaggc caggagttcg agaccagcct gaccaacatg gtgaaactcc 

4 3861 atctctacta aaaatacaaa aattagccag gcgtggtagt ggatgtctgt agtcccagct 

43921 acttgggagg ctgaagcacg agaatcgctt aaatccggga ggtggaggtt gcagtgagct 

43981 gagatcatgc tactgtactc cagcatggga gacagaacaa gacactttct ccagaaaaaa 

44041 gaaaaaaatt aggtaagatg taatacaaca tcagaactat taataaaatt ccctttatgt 

44101 gaaaaatgtt gtaaataaca caaatacatc ttgaaatgga agaaaaaatt gatcaagaat 

44161 taaaacactt ctaagctaat ggtgtaaggg ctttgcagac cttattgggc ctggcagcgt 

44221 gacagcatca catgtgttgg ctgcagaaac ctgccttctc actggaacta gctttcatta 

44281 atagtgtttg agttgtgcag atcttaagga atgcaccctt gttataaaat gtcattgcct 

44341 atattctcca tctgaagtga tcctgtctga aggttagtac tctgcggccc tgactgttct 

44401 aaacagagct tatatacttg gtagaagtca aaaactaggg ggaaaattag tgacattagc 

44 4 61 ttcataaatg aactgaacaa ataaactaaa actttttaaa tgaaagagta tKaaatgtac 

44 521 ttggaggtat caacacatag aagggggctg tggatatttt tgcttattct gaagtttgaa 

44581 attgtctttc agtgatggga atgccattgg tgtggacatt ctggctcctg atctccgagg 

44641 tttttcagaa tgcaggacaa ggatgtgttc ttgcctttca cttccccttg gccacaacat 

44701 gcagtagtaa cctctttaat taagagcgtt tttgtttgct tgttttcccc tgcatggaac 

44761 atcagaaact tttggtttat caaggcaagc ttttcatgca gcatttagcc tttttgtccc 

4 4821 agagcatctg aaaactgaat attgtatatS tagttggatg actatttcat caagacaaga 

4 4 881 aaccaccaat acatttacct gaaatttaaa agcgtagcat atatatatat gcttgatttt 

44941 tgttcttcct gatcccctgc acctgatagt ctctttatgg taatattttc acattctttt 

45001 ttgttgttgt tgttgttttt gagatggagt ctcgctctgc gcccaggctg gagtgcagtg 

45061 gcatgatctc ggctcactgc aagctccgcc tcctgggttc acgccattct cttgcctcag 

45121 cctcccgagt agcagagact acaggtgccc accaccatgc ctggctaatt tttgtatttt 

45181 tagtagagat ggggtttcac tgtgttagcc aggatggtct caatctcctg acctcatgat 

45241 ccgcctgcct cagcctccca aagtgttggg attacaggtg tgagccaccg tgcccgacct 
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45301 gcctgcctgc ctttttctct tttatttttt ctttttttct cttctttctt tctttccttt 

45361 tctttttttt tttttttttt tttttttttg gtggagtcac ctaggctgga gtgcagtggc 

45421 acaatcttgg ctcactgcaa cctccacctc ccaggttcaa gtgattctcc tgcctcagcc 

45481 tcccaagtag ctgggattgc aggtgcctgg caccatgccc agctaatttt tgtattttta 

45541 gtagagatag ggtttcacca tgttgaccaa gctggtctcg aactcttgac ctcaggtgag 

45601 ctatccgcct tggcctccca aagtgcttgg gattagaggc gtgagccacc atgcctggcc 

45661 cacattctgc ttttcttatg taagctctga actgctaagt cgtagtttat tcaacaaatg 

4 5721 acataggaat gtctattcat gatgagtctt ggtataaaag aggatagaat tagtgaatac 

45781 attgttcaat aataaatctc atcaacattt tctgatcaaa atgaagtttg ttagttttcc 

45841 tctcagtaga aatgcatggg ctaaaataca gaaatagtga tgaYtgatga tggtgataat 

45901 tcacatccat aaaatctagg gctacaataa tttggcggat tgaaaggtca ttttggcagg 

45961 cctacagttt tctgtcaagg atccaggaat actttataag gaattgtgaa tgcctgtcag 

46021 tcttttctcc tattgcaaga aggctgacca gtttacactt tattagttta ctgttttgga 

46081 ctttttataa gttattagtc taaataagta ttttagctgg ctctgagtat gaataactca 

46141 cttttttcct ttccctagtg aagaacagtc cagactggca gcaagaaaat atgctagagt 

46201 tgtacagaag ttgggttttc cagctaagtt cttggacttc aagattcaga aRatggtggg 

46261 gagctgtgat gtgaagtttc ctataaggtt agaaggcctt gtgctcaccc accaacaatt 

46321 tagtaggtaa gtctgaaatg tattRtgatt gttattggca acagttcatt tataatctaa 

46381 acattgttca gaataaaaca catgcaaaat attcagtata tgagaacagt tgacatggtt 

46441 atagttgtat gtattcttgc attgtcttcc tgatgttctc agtcatattt atcaccctca 

46501 ccagcctctg cttcccttat cactttgcgg tacccataac tcccctttac tggaatgaat 

46561 ttgattctac ttctgtatgt tttatcgttt tattgctgaa tacacttgga atgcatgaat 

46621 tgaccctaac cttgtatcaa ttttttttcc cagtttggat cttctctttt agtcaacatt 

4 6681 gtgtcagatc taccagcaaa gtgtgaagtt gagcgatagg aacaactttc taattatctt 

46741 ccctgctact tgcaggtgaa gactcacagg caggcagccc tgcccacctc actgcttcat 

46801 ctcatggtct tctggtgtct gttcacaggc acattagttc tgtgtgcccc tggggctcac 

46861 ttggttcctc ctgccctagg cctttgcact tgccactttt atgcctagta tgctcatttc 

46921 ccttttcacc taggaagctc ccacttgagt tccatatgtc tacccattcc actcttcctc 

46981 agagaagcct gtcctgaatt ttgtcttagg tcagacacac agaaacagag gcatcctgtt 

47041 ttcagtctct ccgaactgag aaaaatgaga ttactaggct tagaagaata tgtaccaggc 

47101 ctggcctggt ggctcacacc tgtaatccca gcattttggg aggccgaggc aggcagatca 

47161 gttgagttca ggagttcaag accagcctgg ccaacatggt gaaacctcgt ctttactaaa 

47221 aatacaaaaa ttagctggac atggtggtgc atgcctgtaa tcccagttac atgggaggct 

47281 gaggcaggac aatcattcga acccaggaga tggaggttgc agtgagccaa gatcgtgcca 

47341 ctgtactcca gcctgggcga cacagcaaga ctccgtctca aaaaaaaaaa gaatatatac 

47401 caatagtcca ttcagtcaga cagcttaatc aggtataggt taattctcag gctagtatat 

47461 aagtttgatt aaatttcctg accacaattg tcagctagag aatatttcaa tttaaggagg 

47521 taagatatga ttaaaagtta aactgtcagt attggatctt agaagtaaat gattattagg 

47581 actgtaatag taattattag gactgtaaaa gtaaaggatt attatctgca ttagatatca 

47641 ttatatctaa tgatatagag actgcagaca taactacagg gctctttttc ttaaatcaga 

47701 aaatccagat tcaatagaaa tagggtaaag tgataggagg acaaatagcc ttccatccag 

47761 tggttatcaa ctgacgacta caagtcggcc tcacttgctt taattattct attctatcct 

47821 ttgatgctgc ttgaagaact gtgttttacc tcttgactag tttgtttatt cagtattttt 

47881 ccttgtacag gtcctcattt tatctaaaag cacacaaagc tcttgatttc taaacttttt 

47941 gcaattttcc ttctagttat gagccagagt tatttcctgg tttaatctac agaatgatca 

48001 aacccagaat tgttctcctt atttttgttt ctggaaaagt tgtattaaca ggtaagttgt 

48061 aacaggaagt agtatctgaa agtttgtaag tgttttgagt atggcatttt ctcagtgctg 

48121 aaaagaaatt tcagtgttcg gacagtgggc tagcttcttg tacaaaggcc tcccacccaa 

48181 agtctgatga gaaacgtgcc cactaaaggc acagtgagag cagggaagtc tgaccacagc 

48241 tctgcaagca gacttccatt tacagtgagg aggtgagcat tgcattgaac aaaagatggc 

48301 gttttcactt ggaattagtt atctgaagct ttaggattcc tcagcaatat gattatgaga 

48361 caagaaagga agattcagaa atgagtctag ttgaaggcag caattcagag aagaagattc 

48421 agttgttatc attgccgtcc tgcttggttt atggcctggt tcaggaccaa ggagagaagt 

48481 gtgaatacat gcctcttgag ctatagaatg agacgctgga gtcactaaga tgatttttta 

48541 aaagtattgt tttataaaca aaaataagat tgtgacaagg gattccacYa ttaatgtttt 

48601 catgcctgtg ccttaatctg actgggtatg gtgagaattg tgcttgcagc tttaaggtaa 

48661 gaattttacc atcttaatat gttaagaagt gccatttcag tctctcatct ctactccaac 

48721 ttgtcttctt aggtgctaaa gtcagagcag aaatttatga agcatttgaa aacatctacc 

48781 ctattctaaa gggattcagg aagacgacgt aatggctctc atgtaccctt gcctccccca 

48841 cccccttctt tttttttttt taaacaaatc agtttgtttt ggtaccttta aatggtggtg 

48901 ttgtgagaag atggatgttg agttgcaggg tgtggcacca ggtgatgccc ttctgtaagt 

48961 gcccaccgcg ggatgccggg aaggggcatt atttgtgcac tgagaacacc gcgcagcgtg 

49021 actgtgagtt gctcataccg tgctgctatc tgggcagcgc tgcccattta tttatatgta 

4 9081 gattttaaac actgctgttg acaagttggt ttgagggaga aaactttaag tgttaaagcc 
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49141 


acctctataa 


ttgattggac 


tttttaattt 


taatgttttt 


ccccatgaac 


cacagttttt 


49201 


atatttctac 


cagaaaagta 


aaaatctttt 


ttaaaagtgt 


tgtttttcta 


atttataact 


49261 


cctaggggtt 


atttctgtgc 


cagacacatt 


ccacctctcc 


agtattgcag 


gacagaatat 


49321 


atgtgttaat 


gaaaatgaat 


ggctgtacat 


atttttttct 


ttcttcagag 


tactctgtac 


49381 


aataaatgca 


gtttataaaa 


gtgttagatt 


gttgttatac 


cttgtaagag 


tcatgtgatc 


49441 


atactgtttt 


ctacaaagtt 


gtattttaga 


tataatgcct 


gaaaccattt 


tggtgtttgc 


49501 


ttcagtcagt 


atttcattgt 


atgctgcaat 


gaaaacagat: 


taatgatctg 


agaaccttcc 


49561 


atatattgag 


caactcctgt 


tttctaggta 


ttttgcatac 


aatgcctigga 


atcctcacaa 


49621 


agcttcagtt 


acgttttgtt 


cctctgttgg 


aggtgaggag 


ataaggaagc 


cccagctgag 


49681 


ggacttggct 


gaggttacac 


agctagtaag 


tggtaaaaat 


gagtgagtcc 


ttcaggtgta 


49741 


gaagctggtg 


ccctatccac 


aggctgccaa 


ctctctgcag 


taactttttt 


ttgcttgttt 


49801 


tgcagttttt 


ttctcatgga 


ctatcaggtg 


gacatttgtg 


ggttcttagg 


ttttattgtt 


49861 


agagtggttt 


gttgttttta 




tacatcttca 


gctgactcag 




49921 


agaaagggga 


ggtccttcct 


tcctttctct 




tccccaagat 


aacgaccaga 


49981 


ttagtgggta 




cctgttctgt 


ctatcttctg 


ggcatggcat 


ttgatcccat 


50041 


ttttggaaaa 


aagaaataat 




ttatagcttt 




agtcaatata 


50101 


tcataaacct 




ttgggaatct 


aggatactct 


tcttaggtac 




50161 


tgagctattc 


ctgtaattat 


ggtcatttac 


atgacctcta 


gtttttcaat 


atcatcaaca 


50221 


gcgctgcagg 


gaacatcctt 


gtatgtgtat 


ttttcggatg 


agtgcaggta 


ttcatgtagt 


50281 


aaactttcct 


agaagaggaa 


ctaaagacta 


tacattctga 




gttgatacca 


50341 


gattgcagtc 


caaagagaac 


ggtgcatgaa 


atgccctttt 


cttgcatttt 




50401 


gtgatgccaa 


taatgtatgg 


attaattgga 


aagacatcat 


tccaatatta 


tctcctatct 


50461 


aagaaagaag 


tatctcttca 


tgtatgtagg 


tcttatgttc 


ctcagtaaag 


ttttgtactt 


50521 


tttttcacac 


agggctttta 


tatttcataa 


gtattacttc 


cttaattcct 


aagtatagta 


50581 


tcatttctgt 


tttattactg 


gaagtttttc 


ctaacacttt 


aaaagtta'tt: 


gttgataata 


50641 


tttttcttat 


tccgagttta 


ctgaactcct 


attactgcfcg 


atit'tag't'tga 


ttgcattgaa 


50701 


agttctaggc 


atgtaatcat 


ctgtaaatag 


taattttatg 


ttttcctttc 


caacatttat 


50761 


ttttcctatc 


agtgcctagg 


actttaatgt 


atataactta 


aaaaaaacat 


ctcttagagt 


50821 


tgtagctaca 


tatacaggaa 


atctaacaaa 


tgtgtagcat 




acaaaggcag 


50881 


acacccttgc 


agccaccaac 


aaggtcaaga 


aacaattttg 


ctgcctgtcc 




50941 


tccttatggt 


cctatccaga 


cacacacttc 


tggcttccct 


caagcagtga 


ctattatcct 


51001 


gactctcacg 


catttaaaga 


taattgaagt 


tcctcgtcca 






51061 


caggccattt 


agtgtgtaga 




gtctggggtt 


tgctagttgg 


aaattcatgg 


51121 






tatttcctgc 


aaattggaag 


ctgctgtgta 


attcctagat 


51181 


ctattaattc 


attagtgggt 








tttttcattt 


51241 


attaatttga 






gctccatata 


gcagaatact 


ttcccctttt 


51301 


aagttttcaa 


gataaaatga 


gttcatatta 


atatgtccaa 


ttcaaagctc 


atagggttat 


51361 


tttaccaatt 


gtatatccaa 




aagcttatag 


ggtttttatt 


taaccaattc 


51421 


tgtattacac 


tcttccttct 


acactgagaa 


tttttaattc 


1:taaagacat 


aggggatgag 


51481 


gaattagaat: 


g'tccca'taat: 


tactcattta 


ctttacgtat 


ttactttatc 


attactttat 


51541 


ctgttatatg 


tgcaattttt 


tttgagacac 


ggtctctgtt 


gcccaggctg 


gagtgcag'tg 


51601 


gcacaatcat 


ggtttgctgc 


agtctcaaac 


tcctgggctc 


aagtgatcca 


cctcagcctc 


51661 


tggagtagct 


agccaggact 


acgggtgcac 




ccagctaatt 


ttttttaatt 


51721 


tttgtagaga 


caaagtcact 


caactgttgc 


ccaggctgtt 




taggctcaag 


51781 


ccatcctccc 




taacattctt 


aagagtaaca 


aaaacactat 




51841 


attgccaaaa 


acacttggaa 


cttttttttt 


ttttgctgtg 




ttgccaaagg 


51901 


atattcagta 


tctgaaataa 


aaaggcaaag 


ctgaatatgc 


tgctctctac 


agcagaggga 


51961 


gctgctgtgg 


ctggacagta 


tctgaaccaa 


gcagatctta 


aaactttgta 


ggtgttgaga 


52021 


aatggtggat 


gcatggactg 


gcaccgtctg 


tggagccatg 


attatgtagg 


tgagacttgc 


52081 


tcattatctt 


gtagtgtttt 


aaaatgtctt 


cacatttcta 




gcttaatgca 


52141 


ttttttaatt 


taaatttttt 


atgttgtaca 


gtttatttta 


aatatagttg 


ctatttttta 


52201 


acacagatgc 


caagtcggtg 


ctgtgagatt 


ttctttctgg 


tgatttggac 


cagtttgtct 


52261 


ccctcttgat 


atatccatcc 


caaatggaaa 


ggccctgtaa 


actgttacga 




52321 


aggttaactg 


gaatatacac 


caatgacagc 


ttgcctgggt 


atgccaaaat 


acctgcaaga 


52381 


atgtccacat 


catctggtga 


tgtccccaaa 


taacagtttt 


taccatagaa 


agatcgttaa 


52441 


catgtttgct 


ttaaaagtca 


ttagcagtca 


taacgtactt 


acagattctg 


cctataagga 


52501 


ataatacata 


attttagata 


taaaggcccc 


actagtccag 


gtttccttat 


gccactgtgc 


52561 


ttcctactaa 


gtgttgcgac 


cagctcttgt 


cactagttga 


tgacaactta 


ctccagtagc 


52621 


cacagggctg 


tgacaccata 


gttataggtg 


attttcatag 


atttagccat 


cccaggttcg 


52681 


aaactagtat 


cctctagctc 


ttaagtagct 


gataacctcc 


ccatgggaga 


aactccatac 


52741 


tgcagtttcc 


catatggtgc 


tatgtataac 


tatcttatac 


aattaataca 


aattgcatat 


52801 


gtatacttat 


ataatatgga 


ctactaggac 


agaacttttt 


aaattacaaa 


taaaatagcc 


52861 


aagtagacaa 


ttacattagc 


aagtgatgtt 


acctatgaaa 


acgtgaggat 


ttatggtggc 


52921 


aatgcatttc 


agttaacagg 


gatgtgttag 


gggacaatgt 


gagccaatgt 


agattaagga 
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52981 aataagcctg agaaatttat cagaattagc cgtcagtatt caagcactga tcaacagcaa 

53041 tgtgtcttaa gggcaggcat cactggtgct gagagaactg ggaattgtca actgtgagct 

53101 gctaggggat ggaagaaacc ttagtgtagt cttaggagcc gcttgcttaa acagatgtat 

53161 cagaaacata ataggccaag ggtcagccct ttgaaaactg acttcagggc cttcctttcc 

53221 tcaggctgct gcctcctagg ccagaccctt attttggctt acattccata acccttgtat 

53281 gtgcgatagg gaacctgtat acaatgctqa cacggaaagg gaagaccatc gccttttgcc 

53341 tttcagtgtc tcatctgtaa gcagggccgc Yggctgacca agatcagttc tgaaggtcca 

534 01 gcctctttaa attccagttc tgtgatcaca aagccactgt tgttcctcat cctgccaact 

534 61 gtgatactgc tgcttcagaa ttactgggtt tcctgttcat catactcacc aacctgaggt 

53521 ttggtatttc tcaaatattc tgggcttcca gtacatacta gagcctgtga taatcagcta 

53581 atgatcagac aagtttgttg ggagttttac ctaagtattt ttgtgtttta aaaacctagg 

53641 gtgggaaatg ctcagagtga gatggtttga cttcattagg catataaccc atttttatta 

53701 taaaaagaaa tgcacatata agtaaaaaga ccattttagt tagtcccacc attcgttggt 

53761 aaccagttca ctaagtgtac atctttgcag atttctgtgc atacatacaa atatttttac 

53821 aagaatagga tcataccatg aataccacct ataatggtat gtctgttgtc aatataggtg 

53881 ttataattaa gactgtgtag ccttcctctg Kggatgtacc aaaatttatt taattccctg 

53941 tcactggaca cttttgtttc actaataagt agacactgtg taagcaatcY gtcaacatct 

54001 ctgcacctct atttttggta taagtatttc cttaggataa aatcccagaa atggaattgc 

54061 aaggtataaa agattattaa catttttcaa ggctttaaga tgcctttacg gtgtatctgt 

54121 tacatcctgc ttccacacaa attcttctgt atagccagta cccaagctgc agctctcagc 

54181 acaggtgaag acagccagga tgccccagtc aatgctcttg cccagtctgt cagccttcag 

54241 gtagtttagg agctgaggca tgacctgaga agagggtgac acacagttag aaagctgctt 

54301 cacagcaggg agcacgagac cttctcagcc aggatgatta tagggatctt ggtctttcaa 

54361 tcctcatact acaaagcagg attatagaca ttatacaatt aacatgttta acaatctaaa 

54421 acttccttat gacttcaaag cccctctcac cttctgtttg gtcttttcca tttgagaaag 

54481 aagttcacaa gtggctgtta atgaattatt ttcattacta atatgccact caaaagggct 

54541 gaggcttcta tttgggcaac ttttactttg tatcattgca gatgttgtta ctcttgactc 

54601 aagaaacact aattactagt aatgaataca gaaaggacat ctatcaatgt agttatagag 

54661 accagagagg aatcttagaa gtaqtctaac tcaaagagtg aataggcaga atagccacct 

54721 gatatggaat cactttatac aaatcctgtc acctcaattt ggacattgag agctttggca 

54781 ctaagaacca agcagagttt tgtgtatggt cctcataatt ccttttttac ccaaagaaac 

54841 aaaccaatat tagctatgac tttggtaagg ttagtgaatc catagctcaa gagcatttcc 

54901 accctaccca aatggatttt gatgctaaca aatccttttg ggcagggaag gacatttatc 

54961 tttaatgctt atatccattt tttctaacaa atccacaaac caagattaaa cagtaaagac 

55021 tcctctcata aagtatatag tcaaagactt taattactag aacaagaaag gaaggtatac 

55081 attatttaaa ataacaaaag ttaacagagg cactaataat aatgacataa ccacactgga 

55141 ggtggagagc aStgtagata tcctcattgt cacagaagtc agtcaataga ccgtgtctga 

55201 aaactaggaa acagaaaaaa acaagacagt tccttccagg gaactagccc caaggtgagg 

55261 caggaaactg atgattttca ttatagggta cccttccata ctgccatgtt gacccatgtg 

55321 cacaaattac cttggtgaag tttttaatgt ttaaaaacaa tcatggtgat tacacactaa 

55381 atggtcctta tttaaggtca tacctggaat tccaatattc tcttggcacc acaggggcaa 

554 41 tctggaatat ccttttcttg aggaatattt tcaccagaaa tccagatggg ggcaatacct 

55501 ctgccatatc taagaatcta aaatcaatga agatcatgtt caaataatca ataccttacc 

55561 tataagttgc caatggtaac atgctatcta ctccatgaat gttcctactc ttgatgtagc 

55621 actgacccaa aaggcatgtc acagttcccc catcagacct ggctgtRcca gtgtgccact 

55681 aatgccttct caatcacctc aaagtgatta tttcagttta tctgactcag agggcatcaa 

55741 aatatatctc ccagatgatg cttttactac ctaatgttgg caacttaatc ctatgaatat 

55801 attgtgaagg gactaagaat gagcctctgc tctaattgca gaattctgcc cagagtctgt 

55861 gcctaccttc atagttaaaa aattttagga gggacaaata ccaagtgaaa catagtgttt 

55921 tgaaaactac tacaaacata agtaaatttc actgtaataa gcttcctaca gcaactgagt 

55981 ggttttctgt attttgtcta aaagcatatg cattgctaaa aactgcctta gtgtttaaga 

56041 cctagatcta ttcttcctgt gtatttattt gaaccagtga ctggtttatg ggagtttagt 

56101 tttctttcgt gatttacgtt tatggtaggg gaggttaagg agaaaaatgt taacatgtca 

56161 cattttacaa gccaaagtta cctgttggaa atgggcaaaa ataacctttt ttctttctgg 

56221 cgggggggcc aatggtgcct aaacctcatg taccttaggc aacatctcat tcatctccca 

56281 tccctgatgc ttgctttaga aaatgaaccc tgtatgataa acagtataac ctttagtctt 

56341 ttagtaacta ttaaatggat cagcactgca aaacaccttt ctacatggcc catctgtgtg 

56401 aggaactcct ctaacaagat aacaaaagcc tgcttttata ggctcctaag gaacagacta 

56461 atgttactat gaagttattt cttacagatt atactcataa aacatggcct gaagagaaca 

56521 cgatgaggag ctatgagctc cactttacct gttctggttc aagggctatc tgagttttaa 

56581 acttctgaaa aattttatct tccctggatt catgttttgc catggaatcc agttcttcct 

56641 caagtgcttc acctgaaaaa tcaacgtaac tattatgaaa aacaggagta atccccacaa 

56701 cttgacaatt cacacatgga gaggggaccc acttttaatc agatagcttt ccctatttat 

56761 tcactcattc aagttggacc atctgaattt ccaggtactc catccaactc tattatatgg 
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56821 


acttccattt 


agtgcatctc 


cttaaagctt 


caaaataaca 


gaatggtcaa 


gggcttagga 


56881 


ctgcccagca 


catcacagga 


cacccaacaa 


atgtgagccc 


ttatcattag 


tatcctcagc 


56941 


tggtaggctc 


actcactcag 


tcatcaagtg 


ttcatttctg 


gcctggagca 


gtggctcacg 


57001 




cagtactttg 


ggaggccgag 


gcgggcagat 


cacctgaggt 


caggagttca 


57061 


agaccagcct 


ggccaacatg 


gtgaaatccc 


gtatctacta 


aaaatataaa 


aattagccag 


57121 


acgtggtggt 


aggtgcctgt 


aatcccagct 


actcgggagg 


ctgaggcagg 


agagtcactt 


57181 


gaacctggga 


ggcagaggct 


gcagtgaacc 


gagatggtga 


cattgcactc 


cagcctgggc 


57241 


gacagagtga 


gactccgtct 




aatgttcatt 


tccttctcca 


cattccttcc 


57301 


tgggattaca 


gccaccctaa 


gccactgctg 


tccccaacag 


acccgtgtct 


ctaagtataa 


57361 


ccattagtct 


ttgtaatgta 


cgttaaaata 


gaactgatat 


accttgggtc 


agagaagcta 


57421 


aaataactgc 


tttgatgaaa 


ctggaaaggc 


actgatggtg 


ttcacttgca 


ccatcaggtc 


57481 


tgatggagga 


agtgtaggat 


gccttcagat 


tgatgttcca 


tcaagtatac 


gtggaaagtt 


57541 


tcagtataac 


cgttaggaca 


ctgtaaatgc 


tgttccctca 


ggccctactg 


cctcctgcca 


57601 


agtctcaggt 


aagacacagc 










57661 


tttagaaaat 




cagaaaagtg 


gaaatactat 


aatgaaccgc 


cacatatcat 


57721 


taatcagttt 


caacactatc 


aagtccagtg 


tttcctttct 


ctgccacttc 




57781 


actctgaagt 




catatcactt 


cattcataat 




acccctcgaa 


57841 


ggcaaactct 












57901 


aagatgatta 


ctgagttttt 


ttaaagtatc 


attttgaacc 


cattaaacat 


atctgatgca 


57961 


gttagcgtcc 


ttgtggatgt 






aagcaggagc 




58021 


tgcttgagtt 


ctgacatggc 


cctagtaatc 


cttatcctta 


atctttgata 


tgaccatgtt 


58081 


cccacattat 


atgaacgttt 


cctgacctag 








58141 


tggagttcct 


tttagagaga 


aatggtacgt 


agacacaatc 


aacattatct 


tcctcctacg 


58201 


cccaacatct 


cagttctcag 


taacaccaac 


ataattactc 


gtttcctttc 




58261 


acacacaacc 


atctcaaaat 


aaaagcaaca 


gtctagtaat 


aacatgttta 


ttcaaaatac 


58321 


taacactgtt 


acattctttt 


cattctcagg 


gcatattcta 


ctagagatgt 




58381 


gttttgaagt 


cacctggaag 


agttcttagt 


gtggttatat 


gactacatca 


agagtttttt 


58441 


acttttgatg 


attagggact 


gctttttaaa 


acttacttta 


ctccataatc 


ttaaaatact 


58501 


catacagttc 


cacagtcaca 






ttgaagtcca 


gctttcatcc 


58561 


ttgatcctgc 




tctcctttct 


cctaaagata 






58621 




ataggcatga 


acacacgcgc 


ggccccttcc 


aggcagtctt 


cagtgatgtc 


58681 


acgtgttccc 


atggcacctg 


tattgtactc 


ttatcagtca 






58741 


ccagatatta 




tccataagac 


tgtgagcatc 


tgaccactgg 


agtgttgctt 


58801 


cccattatat 


ccctgttatc 


aagcacaagg 


tcaggcacag 


agtaagactc 




58861 




tgactggtat 




ccagtaagct 


gatgttttca 


ttttgagtct 


58921 


ataaatctaa 


ttttgtggtg 


gttttgtgta 


tKgctcaagg 


ctcaaattgt 




58981 


attatgtgac 




tatacccaga 






aaaatqqqqc 


59041 


agtttctcac 


tcattggtct 


gctgtcacga 


ttttaatgag 


ctcatgcaca 


aacagccctt 


59101 


tatataaggt 


aagtgctgga 


taaatgttgg 


ctactataat 


aaaataagcc 


tctaagatac 


59161 


ttggtcagca 


caagtactac 


ccaagagtat 


gcactgtaag 


taaactgaca 


aaattgtgta 


59221 


tctaaaactg 


gccagatgaa 


agagaaactt 


ttaaggggcc 


cttctgcgtg 


cccgacactg 


59281 


tgctaggcac 


tcacactatc 


ccgacccgag 


aaacMgatct 


gcgacccaga 


ggaactliacc 


59341 


aagcctccag 


catcttgtgc 


agccctactc 


atgggaccat 


ctggataccc 


acccttgtct 


59401 


ttacagggag 


cagaacacac 


ctcttatgtg 




aagtccagga 


agtatatttt 


59461 


tacctgaggc 


aatatctgaa 




tacagcctcc 


aaagtgagtc 


ttcctctcag 


59521 


tacctctctt 


ctaggcacat 


ggagcccttt 


cttccaagta 


ttatgtttaa 


ccacttaatg 


59581 


aatgaagtcc 


tgaaactgct 


tacccatgct 






cttccttttc 


59641 


cacaacctca 




catcttctgt 


ttctattaca 


atttcaaatt 


ctggaaaaag 


59701 


gaagttgtgg 


tctggaatta 


tatggtccag 




acaaaaagga 




59761 


agtaatcatt 


tagttttgaa 


gacagtctaa 


taatttgctg 


tctctaaagt 


actatattcc 


59821 


ctatagttct 




ataaagggtc 


ataaattaaa 


tgcctatatg 


gtgacattat 


59881 


tcagtgattc 


agacttcaca 


gccttttttt 


tttttttaca 


aaggtgttcc 


aggcatgaaa 


59941 


aattttaaag 


tactatacct 


ttcctaattt 


tacctttaaa 


gttgtcctgg 




60001 


gttgacaaag 


gcgatgaaac 


tgaactgaRa 


cttaaaaaaa 


agattaccca 


cctggttgtg 


60061 


cacaagcctg 


cttatgtccc 


aatctccagt 


ctagggtctg 


atgctccttg 


ctgcagtaat 


60121 


atgctttgtg 


gcatctggag 


cacgttttgg 


ggcctaaaca 


gccacaaacc 


ctgcagagat 


60181 


gagcaccaga 


cttaagctgg 


agacacactg 


attctcctgt 


ttctggggga 


ggattctcag 


60241 


aaggtggctc 


atatgagtaa 


aaatcgtttt 


tcctgggtag 


ttgattccta 


aaaactaaaa 


60301 


aagaatacag 


agaaaagttt 




caaaacagca 




tttatcctct 


60361 


gcacgtaaaa 


ctgaaaataa 


caacaacaaa 


aaagaaatga 


aagtttttgc 


tttcaggaat 


60421 


aagcttttaa 


aatccagaaa 


ctagatttcg 


tccggtacac 


gcaactgagt 


tgcctcctag 


60481 


aggtggtttg 


agttaatcaa 


attaataaga 


ctgatcgtta 


agaacgactg 


ccaaaaatac 


60541 


gaaaaagcta 


ctgggatcca 


tctttccaag 


acaatttcta 


ttatctgaat 


taacaccata 


60601 


cctggtaccc 


actgattaaa 


agctgggggt 


taccaatgcg 


cgtgggcaca 


gttagaagct 
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60661 tatgtagcaa aaatgagcac atcctggaag ggcccgggag aaggtgctcc tggggcagcg 

60721 cggagaggga gctctgaggc tggggcggca gcggtgcttg ccgccgtccc cctggtcgct 

60781 cccggaatta acgccgcgca cgcgtcggag gcatggcccc gtcccgaccc cgtttggcgg 

60841 ctcacctcgc aggccggcac agcacggctg ctcgcggcag cagaagagga agatgcagcg 

60901 gtggaaggcg tccgggcggc caggcagcgg cgcatacacc tgcagcagga aggagagcgg 

60 961 gcggccgcac agctcgcagg ccagggcctg gggccccggc agcccggccg cgcccagcca 

61021 tgccggccgc ccgcccacct tgctggggaa ctgctcgctg cgcagtcgcc acgccggcgc 

61081 cgactcggcg aagcccagct ccacaggcct ggccccggcg gcagccatgc ggggcgcggg 

61141 ctggcgtggg gcgcagccca cagctgggtc ggaaggcgga aatcgggcgc cgggccggaa 

61201 ggcaagaggc gggcaccttt ccggaggaca ggaggcggaa acgcgtctga cgggagcggt 

61261 tgcaggacca atgcgaggga acggggcaga ggaaacctct cggcatcagc cccgcccctg 

61321 gcgcctctgc ctccgagccg ctttcctggt gcctccgggt gctctgggat ggttctggtc 

61381 tttgggagag tggcagctgg tgacggcgct ccgctcacct ctgcacatgt cttgctgtgg 

61441 gcctgcgggt ggccgccagg gaggcagagc cctcccKcaa accttccctg ctggtgtcca 

61501 cctcagggtg tgggaaacct gtgcgctggc cgagtgctaa ccaagagtag gcagtgaaag 

61561 acaaatgaag gttgaacagg taaagtgagg accctacagc ggaaaccaag aatcctgtgt 

61621 gcctgagagt aatgaagaag cctctgcaga agagtctttt ctgtcagtct taaggtctct 

61681 gttttaatgt tagtgctggc ttgctgtacc tgaattccaa gggaggagtg tataatgagg 

61741 catggccaac ccccacttcc catcattgcc tgaactagtt tttcaggtta acttcagaat 

61801 gcccttgggc aagcagaggg tccatcagtc ggttggaggg tttagaattt tactgttggt 

61861 ttgcaaaggt ctgaaagaaa catgtaccac ctgttctctt taaggagttc tacttaggag 

61921 gtttcattta cataacaaga ccgtggttgt cagccaggtc tccacccgca taacctgtta 

61981 tgccacaatc caaaccccca ttctgtaacc tcaagatggt atataagttt ctgaacccca 

62041 tttggggctt cagcaaaatc actctggttc tcccccatgt gcatgttaat aaatttgtat 

62101 gccctttctc caattaatgt gccttttgtc agttgacttt tcagtgaacc ttcagaggac 

62161 aaaaaggaag ctttcccttg gctactacag tggctttaat ggaagtaaag tcatcaacaa 

62221 catttatttt tgacaaaatc acagtttagt tggcagtata tttgtttgtt tttgttttat 

62281 ttgagatgga gtctcgctct gtcgcccagg ctggagagca atggtgcgat ctcggctcac 

62341 tgcaacctcc gcctcccgag ttcaagcatt ctcctgcctc agcctcccga gcagctgggt 

62401 agcagtatat ttgtaatggt acataaaata atgtctgttt ttaaataaac atttacattg 

62461 taaaccaaaa gttaactctc catcaatttt tttttttctt ttttgagatg gcatcttgct 

62521 ctgtcaccca ggctggagga gtgcagtggc atgatctcgg ctcactgcaa ccttcgcctc 

62581 ccaqqcaagc gatcctcttg cctcagcctc ctgagcagct ggaattatag gtgtgtgcca 

62641 atacaaccag ctaattttcg tatctttggt agagacagag tttcaccatg ttggccaggc 

62701 ttgtctcgaa ttcctgacct cagttgatct gtccgtctca gcttcccaaa gtgctgggat 

62761 tacaggcatg agccaccaca tccggcatcc atcaatttag aaagtttatt tcaccaagat 

62821 taaggttgca cccgtgacac agcctcagaa ggccctgatg accatgtgcc cttggtggtc 

62881 agggtacagc ttgcttttat acatgttagg gagacatgag acatcaatca gtatgtgtaa 

62941 gatgtacttt agtcaggtaa agcgggactt gaggtgaggg cttccagRtc atgagtagat 

63001 aagaggcaaa agatcgcatt tttttgagtc cttgatcagc cttccactga atacacaatt 

63061 tagtctggct cagtgaatta tcatttttaS gtaaacaata ggggagggga agcaattaga 

63121 tatgcatttg tctcaggcgc accttaaagg gataactttg agttctgtct gtcctttatc 

63181 cacaaggaat ttccttgtgg gcaaatttta agggaggtac gtagcctctt atcttggcag 

63241 ctatcttatt taggaataga atgggaggca ggtttgcctg acatagtttc cagcttgact 

63301 ttaccctttg gtttagtgat tttgtggtcc tgagttttat tttcctttca cagaaattat 

63361 accgtaaaag taattgaaga aaatcacttc tttccccttc cctcaactag gccttgacca 

63421 ttttaaataa aatcaggatt tgctgaaggt caacaaattt aaccaagttc agttaaaact 

63481 taactctgaa tctgtatgtc cctggggtct tttccagtga gagatgtcta agcatcattc 

63541 caagcttttc tatactaact ggcctatttg tatgtttctt attttaggat tccttttgtt 

63601 catgtgtatt tttattaggc aatcacccat ttcctgtagg tttccaggtt aatatatttt 

63661 tcttattgga attttaattt atcctctcta gttttctaca tatttttttt ttctttgaga 

63721 cgaggtctca tatgttgccc cggttggcct caaactcctg ggcttaagca atcctcccac 

63781 cttggcctcc caaagtgctg ggattatagg tgtgaattac tgtacccagc ctagaatcct 

63841 tagtcctata tacttttgct gtttttttca ttgccaactt gaaataataa aaagggtcag 

63 901 aatcctataa accaaaaata aaataaggcc ccccaaccct ctgaatggac ttcctcctct 
63961 gacacagatc ttttacaatt taacctgtat gaaccccaaa aattggagac aggtctcagt 
64021 Kaatttagaa aatttatctt gcgaaggtca aggacacacg attatgacag cctcaggagg 

64 081 tcctgacgac atgtacctaa gatagtcaga gcacagggtg gttttataca ttttagggag 
64141 atataagaca tcaatcaaca tatgtaaaat gaatattggt caggaaaggt gggacaactc 
64 201 aaagaaaaaa tgggacaact cgaagtgggg aggggcttcc aggtcacagg taggtgagag 
64261 acaaatggtt gcactctttt gagtttctga ttcacctttc taaaagaggc agtcagacat 
64321 gcatttatct cagtgagcag agggatgact gaatggaatg ggaagcaggt ttgccctaag 
64381 cagttcccag cttgactttt cccattagct tagtgattat gggattccaa ggtaatttcc 
64441 tttcacattt cccccctttt cttttttaaa atattttgga gaaagcattt ttgaagaaaa 
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64501 taagtttctg ttcccaggtt ttatctgctc tctcatggct aggatggttt tttcctagaa 

64561 gggtaggtcc tgagttatta ggaaagctca tttttagaag gttgtgaagt ctaatatcct 

64621 atcaagagaa aatttgggga ggaagggaga acaataagaa caatcttgga aaattgatct 

64681 aggccacatt actctgaagt ccatacatca gtaagcaggt atgaaagtgg cttatgtatg 

64741 taaataggtt cccattattt tcttctgaag tttaagttgt ctacttcagt tcacagggct 

64801 tcacgaaagt tagttttaag tgacttagtt tagtgacagc ttagttttaa gtgactccaa 

64861 attaggaaaa atggggaaaa aaagaaggaa aaaaattgaa aacattattt tgaagacttg 

64921 tagcccacaa aaattagaat ttggtccaaa ctgtagaaaa tgataaaaat tgaaaaacat 

64 981 taggcaagac tagaatctaa caactggtgg actatagttt tccagtctct agtttcccat 

65041 ttatactaaa gacaaatcat gataggtttg ccttattata tttggccgaa ttatttgtat 

65101 acagtgcagc aagaataatt attttttaac attggctttt aaattggctt tgatggaact 

65161 ttgttccata gagggtattt cagataagac tttttaaaag ctgagcccag ccatggattt 

65221 gtgccatcaa atacctgtga gtttggtgat cctctcctct tgaggttcca agataaactt 

65281 gaggctcctg ggcctgtcag aaagtgacat tctttactta ccacaggtca ggaaccctgt 

65341 acaggagctg tgtagacaaa gttatgagga cggtttttcc aagggggttt aattggctcc 

65401 gtaagtcaag taaaagcatt ggtaaacaac cagtttcccc aattgtgtcc tgttacaaat 

65461 gaaaacagat tattagtgca cttatgcaaa taactgtatt gtcataagtt aagaatactt 

65521 acagtttcca aattctggag aaatcgggta gagagaacca aatgtgctcc aaattttgtt 

65581 cataggagta tgtgttactc aattgttaaa agctgcagat agcctgacca acatggtgaa 

65641 accctgtctc tactaaaaat acaaaaaata gccaggcatg gtggYgcgca cctgtactcc 

65701 cagctacttg ggaggctgag gcaggagaat cacttgaact tgggaggtgg aggttgcagt 

657 61 gagccgagat cgcaccgctg cactccagcc tgggcaacag agcaagactc catctccaaa 

65821 aaaaattatc ttgtttttat caatctttct taaatgtata gctcacattt atttcaatgt 

65881 ttaaaatgag aaatatttgg ggtctttatt tagacgtttg ttgatgtttt tgtgaccaga 

65941 aatatgttac aagaacttaa ctcttgttta tatcaatcat cccatggcaa attggtttta 

66001 ttaccagtag tgttgcttaa agtcacagtt tccaagaatc tatccatgat aaatgaggac 

66061 ttactggact taatagtact gaattgtaca cataaaaatg gttaagatga taaattttat 

66121 gttatgtgta ttttaccaca atttttaaaa atgaaaaaaa gaaatcctgg ggaatgatat 

66181 gttcaaagaa ggctttgaaa agctctgata tattccagaa atctagacag ctacacaggg 

66241 gcagggcttt gtgcgtggcc atgaaagacc tgaggcagct gtaatgtttt acatttgact 

66301 ctggggttct gtgccatcag aaagtgaagg ctaaagcaga cttggactgc cagagtgttg 

66361 aagatgtgtc ccagcacata caaagccact ctacaaaggc tgqgaqacat aggttcaagg 

66421 catttaagga aatctctatc taatcattag ctgaccactc agcacactaa gcaaagactg 

66481 aaagacacat gacaaagaat acagacttca tagaattagt cagggaaagt cactaaaaaa 

66541 ttcaacagca acaacaacaa aacccttgag agggaagggt ctggcttaca gttaccacat 

66601 cctagtcttt aaaatattca gttttgagca aaaatatgag atacacaaag aaacaaaagc 

66661 ccatgcagag gaaattctga gaccagctcg atcggggaga ccctaaccca gtggcactag 

66721 gggatattaa agacacacac acacaaatat agaggtgtga agtgggaaat caggggtctc 

66781 acagccttca gtgctgagag ctccaaacag agatttaccc acatatttat taacagcaag 

66841 ccagtcattg gtattgtttc tatagatatt aagttaacta aaagtatccc ttatgggaaa 

66901 cgaagggatg tgccgaatta aaggaatagg ttgggctagt taactgcaac aggagcatgt 

66961 ccttaaggca cagatcgctc atgctattgt ttgtggctta agaatgcctt taagcagttt 

67021 tccgccctgg gcggaccagg tgttccttgc cctcattccg gtaaacccag agccttccag 

67081 cctgggtgtt atggccatca ggaacatgtc acagtgctgc agagattttg tttatggcca 

67141 gttttggggc cagtttatgg ccagattttg gggggcttgt tcccaacagg aaaataaaga 

67201 agtcaatgga aattctccat gaggaggccc agatgttatc cttcctaggc aaagatttta 

67261 aattagctat tataaatatg ttcagagaac taaaggaaac catttctaaa gactaaagga 

67321 atgtataaga atatctcacc aaatagagaa tactaatgaa gagataagaa ttagaagaac 

67381 caagtaagaa atgctagagt tgaaaattac aaaactgaaa tacaaaattc gctagaggca 

67441 ctcaacagca gattagagat tgcaaaagaa agaaacagca aacttgatga taggtacatt 

67501 gagattaccc aatcagagga gcagaaagaa aaaagaataa agagaaatga acagagcctt 

67561 agaaatctgt gggataccat caaacaaatc tggaagtccc agaggagaga acaaggggga 

67621 aaaaaggcag aaaggctatt tggagaaata atagccaaaa ttttccatta agcttttcat 

67681 ttgattaaaa tttatatatt ttaaaaagtt taagtgtatc ctactcatac tttttaaaga 

67741 ggtccatacc tagacacatc atagtcaaac tgccaaaaaa gagagagaga atcctgaaag 

67801 caccaagaga aaaacaattc atcttgtata agagatcatc agtaagatta acagctggct 

67 8 61 tcttgtaaaa aactcacaga gaacaaaatg ctgcagaatg acatacttgg gaaggcagtg 

67 921 gggtggagaa gcatgccaac taagaattgt atatccagct aaactagcct tcaaaaacca 

67981 aagaagaaaa ataaaaaaaa aatgagatat tcccaaatta ataataaaca aataacactg 

68041 agataatctg tcattcatag atctgcctta tgagaaatac taaagggcct cctctttcag 

68101 gctgaaataa aagaacatta cacataactt gaatctacag aagaaaaaag agcacggtta 

68161 aaagtaacta cataggtata taaaaatgat agtataaatg tatttttgtt tgcttgtaac 

68221 tcattctctc tgatttaaag gacaactgca ggaaacaata ccaataaaaa tgttttgaca 

68281 atgttatatg aaaatttaat ttgcatgatg gtaataccac aaataagagg gatgggaata 



139 



PATENT 
Docket 524593008700 



68341 gagctatatt ggagtaaagt ttttgtgtat gatttgaatt aagttggtat taatctgaac 

68401 tagattgttt taaattaaga tgataactga aattaaatgt taattgaaag gcagtcacta 

68461 agaaaataga gtaaaagaaa caaataaata aaatggcact taaaaatatc taacagtaaa 

68521 gaaagcaata atggagagca aaatgacttg aaacatacag aaaataaaga gcaaaatgac 

68581 aggcacaaat cctaccatat cagtaattaa atgtaaatga attaaacatt ctaataaaaa 

68641 aggcagagat tggcaaaatg gattaaaaca caaaacacag acagaaacag caaccattct 

68701 ccaattatat gaggagacct gctttacatt caggaaaaca aataggttga aagaaagggg 

68761 atggaaataa atatactctg caaacagtac ccaaagacag ctgaagtggc tatataaaca 

68821 ccagacaaaa tagactttaa tgcacgaatt attagagaca aagacatttt ataatcaaag 

68881 ggtcaatgta tcaggaagac ataacaatta taaacatata tacctctgac agtagagcac 

68941 caaagtaaat gaaacaaaaa ctaacttaaa aagagaaata cacaggtcaa caataaatgt 

69001 tggagacttt aatatcccac tttcaataat gaacagatct aggcagaaga tcaacaaaaa 

69061 atagaagact taacactatg aaccaaatag acctagcaga catctgtata atactgcatc 

^9121 caacaactgc acaatatata ttcttcaaat acacatgaaa gattctctag gatagaccat 

69181 gtgttaggcc ctaaaacaag tctcaataag tttaaaagga ctgaagtcat acaaagtatg 

69241 tcttccaacc acaataaaat gaaattagaa atccataaaa gaagaaaatg tgagaaattc 

69301 acaaataagt agtaatgaaa caaaataacc tatgggataa aaaggaaatc acaggggaaa 

69361 ttagaaaata tgtagagatt aatgaaaata caccatgtca aaatttatgg gacacaggac 

69421 agtgttgaaa aggaaattta cagctgtaaa cacctatatt taaaaagaaa gattttagtt 

69481 caataatcta aacttctacc attagaaatg aaaaaggaaa tgcaaacaat ctaaatcaag 

69541 cagaaagtag aaaatagcag agattagggt ggaaataaat ggaagaccca aaaaacagag 

69601 aaagtaataa aatcaaaagt tggttattgc cgaaaccagc ttggctgggg agaccctaac 

69661 ccagtggcac tagaggaatt aaagacactc acacagaatt atagaggtgt ggagtgagaa 

69721 atcaggggtc tcacagcctt cagagctgaa agcctcaaac agatttaccc acatatttat 

69781 tgacagcaag ccagtgataa gcagtgtgtc tatagattat agattaacta aaagtattcc 

69841 ttatgggaaa caaagggatg ggccgaaata aagggatggg tctggctagt tatctgcagc 

69901 aggagcatgt ccttaaggga cagatcgctc atgctatttt ttgtggttta agaacacctt 

69961 taagtggttt tccgccctgg gtggtacagg tgttccttgc cctcattccg ataaacccac 

70021 aatcttccag catgggtgtc atgaccatca caaacatgtc acagtgctgc agagattttg 

70081 tttatggcca gttttgcggc caatttatgg ccagattttg ggggcctagt cccaacgtgt 

70141 cccccttctt tgatttgcaa agtgataaaa gcaaaggcag ttttgtcacg gtgagctact 

70201 tcttgcagga gtcaggatcY gcatctgcag actatacaaa aacaacatag attaaaagca 

70261 caatcatcat cgaaatcaca gagcttccaa gtgttttcat ccattttaat gggttgctag 

70321 ctKcttatct gtctgcagct cctttaagca ctccttttcc tggcattaag gtcaggtgtg 

70381 cctaggatgc tttatttgtt cttttaattt tgcaatatcc aaaaacaagc ttgtagagtg 

70441 tccttctaga tgctttttaa attccttccc aaattttgat ctaattaaga gctattaata 

70501 atttccacaa atccttattt aagctcctag agtgggccat atcatttgag gttgaggtgc 

70561 cactataccg ccatggttcc agatgatagg aactcttgcc atgtcttatc atttctacca 

70621 tctgaccatt ttgttcagac cagctgaaca tagtgtggct gtggcatgca gactgagagg 

70681 tgcaattcaa gccaaacatc cccttagggg accaatcaat aatgattcca taggaagcat 

70741 tgtgcagcac ctctgcctgt tctgcaatgc agtcttccta aacaagtaca ttcatttttt 

70801 ctaactgggt ccaatcctgt ttacaaatag gtttttgagg gcagtatgcc ttaattatag 

70861 gagcagattt attatggtaa atactgagat cagaaagcat gtgtaagtgt gtcatagagt 

70921 gattacatcc aggcattatt gccagccaag attgataaMt atgcccaata agtacagttg 

70981 ttctctgtgt cagcacttgt tgaaggaata ctcatggcaa tggtgatcac cgctgtcata 

71041 gctaccatta aattactcac tgtgactggt tgtcctgctt tcctcaggtt ttcttctgcc 

71101 aactgtgaca gcttcttgat ctgtccccaa gtgggtggct gtgtttgatg ggtgttgctc 

71161 gtgacagttg gggtcctcct cagcatcagt cttgagatgg ctgcaaccag ggggtcctca 

71221 ggatcctcct ggaatctctt cctcagcatc tggctcatga taaggtttca ggtatcttga 

71281 tggtatccaa atcagctctt gattttggcc tggagaaaca caagcataac ctctacccca 

71341 agttattatt ttacctattt cccaaccttt tgttatgcaa agtgagaaaa ggtgtgcaca 

71401 catacatgta acgtgtgaca tccatttgcc aaagataatt aggttccaat cctcgaggat 

714 61 taactcctcc tgcaaaagat gaggaatgca ccatttggca agttgggcat cactggatca 

71521 taactttagc ttctttccag gtaatgctgt atctgtgttt cagaccagag gcattaacat 

71581 gggttaaatt gtgaaaatgt ctagcagtag atattgcttt agcaactagg tgatcagcca 

71641 tttgattccc ttcagtcaaa ggtcctggaa gaggtgtatg agccctaatg tgagtgatgt 

71701 aaaaagggtg cattctactc ctaactgctg tttgcaactg ggtaaataaa gtcattagtg 

71761 tcatctgttt gaaattgtaa ctgagcattt tcaactaact gtgtggaatg aaccacgtat 

71821 gaagaatcag aaatcatatt aatagacata ttaaaagcag tcaatacctc aattacagct 

71881 acaagctctg ctgttagagc tgaagtatag ggtgtctgga aaactttaac ttttgagcca 

71941 gaataagaag ctttaccatt actagaccca tctgtgaaac aatgaaaatg cttagcaggc 

72001 tgcaagttgt ttactgcagg aattataaat gtaaaccatt cacagtcttg cttagctaaa 

72061 gtgatagtaa agaaacagtc ctttaaatct gtgacaatta aaggccaatt ttttggaatt 

72121 atagcaggag aaggcaatcc aggaggctgt aatgctccca taggttgtat aactgaattg 
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72181 atagctatta agtcagttaa cattctccat ttacccgatt ttttcttaat tatgaaaact 

72241 ggagaattcc aaggggaaaa tgctggcgct atgtgcccat tttctaattg ttcagtaact 

72301 aattcctcta aagcctccag tttctcttta cttagcggcc attgttctat ccaaattggc 

72361 ttatctgtta agcattttaa aggtgtaggt tctggaggct taacagtggc cgccatcaaa 

72421 aatgatatcc tacaccttgg cgggaacttt gtctttacgc ttgaagcggt tccttcaaac 

72481 cttgcaattt ttttcctagt cccataccag ggacatgccc catttcatgc atcatatgtt 

72541 gactttgagg gctatataat tgttctggaa ttagaacttg tgctccccat tgttgtaata 

72601 aatctcttcc ccataaattt gtaggtacag aagttataat tggttgaata gtcccaggtt 

72 661 gtccatcggg ccttcacaat gcaaaatata actactttga tatacttcag gggctttacc 

72721 aactccaact atgttaaatt gagcgggtgg aattggccac gcaaacagcc agtgctgtag 

72781 agaaatgatt gaaatgtctg ctcctgtatc taccaaacct ttaaatttct ttccctgaat 

72 841 agttatttca caggtaagat gtttatcagt aatttgattt acccaataag cttctttgcc 

72 901 ttgtttattt gtgctttcaa atcctcctgt tcgtttaatt taacttttcc gtatttctac 

72 961 atacggcaca atcaggagct gtgctataag ctctcctggc tctgctttcc agggaacaga 

73021 agtggatata acaatttgaa tttccacatt gtaatctgaa tcagtgactc ctgtttgtac 

73081 ttgtatccct tttaaattta aactagacct tactagaagt aatcctatca tccctgctgg 

73141 caatggtcca cagactcctg ttgggaactt ttgtgggggt tccccaggca gaaggctcac 

73201 agcatttgtg cagcataaat ctactgcagc agtaccagct gtggcagggg acaaacattg 

73261 taaatcccat gaagtatatg gggtattagt gcattcattt agagctgagg gctcttaggt 

73321 ggaagatgtt aataaactgt ccaggtccca tggttcagca gagcattttc cattgctggg 

73381 gcccttaggg atctccagga tttatcccag agtgcccata atggacagta caaaaStggg 

73441 aagagtttcc aagaaagcaa ataccccatc atccaccaca aaacacactc tcagcttatt 

73501 catcaatgtg gcactggggt ccttaatgtg tggcaggcag aggtaatgac agtgagcaaa 

73561 tagcccaggg attacttgag ctaaaagcaa caaaggaacc aaacgcttcg actttctgac 

73621 caaactcagt tcccaactga agctggtctc atgcaagtgt gctggcctga gtgatctcac 

73681 atcctgtgaa ggctctgcca aggagggatg gtggctaggg tcaggcctcc ttaagagggg 

73741 tcccggatcc taccagaggc taaggtggga ggatcgcttg agcccagaag tttcagagca 

73801 gcctggacaa catggagaga ccctgtctct gcagaaaaaa aattagcaga gcatggtggc 

73861 aggagtgttt aatgccagct actcaagadg cagaagtggg aggattgcct gagcacagaa 

73921 gtaccaacct gcagtgagct atgattgtac cctgacactc cagcctgggc aagagagtga 

73981 gaacctgtct caaataaaga aaggaagaaa ggaagaaagg aaggaagaaa ggaagaaagg 

74041 aaagaaggaa ggaaggaaaa ggaaaaggaa agggaagaaa aaggaaggga agcaaaggga 

74101 ggaaggaagg aaggaaggag aaagaaagac aaacagagaa aagaaagaaa agaaagaggt 

74161 gctttggaga aacccacaga agacttttac actttagttg tgttttgaat acttaaatcc 

74221 aacatacaaa ggaatagaaa ttatattctt cacaattatg atttttcttt tctttctccc 

74281 tttttttttt tttttgagac agagtcttgc ttcttcgccg aaactggagt gcattggtgc 

74341 gaccttggct cactgcaact tctgcctcct ggtttcaagt aattctccta cctcagcttc 

74401 cccagtagct gggattaaag gcatgtgcca ccacacccgg caaatttttg tatttttagt 

74461 acagacaggg tttcttcatg ttggtcaggc tggtctcgaa ctcccaacct caggtgatcc 

74521 gcctgcctcg gcctcccaaa gtgctgggat ttaaaggcat gagccaccgt gcccggccag 

74581 gtctcagatt ttttagaaag tttattttgc caagttggag gatgcgtgcc tgtgatgcat 

74 641 cctcaggagg tcctgacaac atgtgcccaa ggtggttggg gcacagcttg gttttatata 

74701 ttctagggtg acatgagaca tcaatcaata tgtgaaagat gtacgttggt tcagtccaga 

74761 aaggtgagac aacttgaaga gaaggccaaa cagggggctt ccaggtcata ggtagaaaag 

74821 agaccaatgg tttcattctt ttgagctgct gattaccctc tccaaatgag gcaatcagat 

74881 atgcatttat gagcagacag gtggctttgg atagaatggg aggcaggttt gccctcagca 

74 941 gttcccagct tgacttttcc ctttagctta gtgattttgg gtccccaaga ttgatttttc 

75001 ttttgtaagg tctaacatgg tttcctatga gcattaatta ttcattgtgt attttattac 

75061 aaaaataagg cacagatttt ttaaaaaaca tcaatttcat gactagttat atacacataa 

75121 ttacactgaa gttcaactaa atttggaaac attccagagt ttgggtttct aataattctt 

75181 tgtgattctt tagaagctaa aatattttaa caaagcaaca tctaaaatca cctgtagaat 

75241 gtcctgccat ttttgtttct ctagtttcct cattttctgc aaagcctcgc tgaggaaatt 

75301 gactctgaat atccttttac actcttctgt tttagaaagc attgtggtga aacattgaat 

75361 cataatggtc acaagttctg ttcacattct ttctttcttt gaatattttt tcccagtggc 

75421 caatatttga ttctgttgta ttatgggtaa aaggtaggca tgagaacaaa ataaagacaa 

75481 gaggtctttt gaataagtga tccagtcaca atgaatcaat ttgccattgg aacatatttt 

75541 tacgtcactc ttctgaaaat atttagccat gaattgaaag agagtctgta agattatttt 

75601 tttcctgttc taaggtgaac agcattttag agaatgaacc acaaccacag cacaagaaaa 

75661 aaatctgata aataagttta cacatatgtg ttactactgt aacataaaac atgtaaagag 

75721 catttgtttt gatttatata tcagtctgta ttgtttaatt ttttgtgtca taaatgctct 

75781 tatttaaaag acaggactat ttaacagtgt aaattactag taattcatgg tataaataat 

75841 taaacaagga agtgttcaaa aatataacaa tgttttaaat aagcccattt tgtgcttctg 

75901 taacagaata cctgaggctg ggtaatttat aagtaaaaaa cgttcatttg gttcacaata 

75961 ctggtggttg gaatgtctga gattgggcag aggcatctgg tggggcctca gtctttttca 
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7 6021 cctcatggtg gaaagtggaa ggggagcaag ggagtgcacc agagatcaca tagcaagagc 

76081 aaaagcaaca gtgaagccaa ggaagtcaga ctctttttaa ctacctactc ttgcaggaat 

76141 taatccattc ctgtgagagc agaactcact cacctctgtg gaggacaata atctattcat 

76201 gaaggatcca tcccatgacc caaacacctt ccactaggcc ccacctcccc acactgccac 

76261 attgggggtc aaatttcaac atgagatttt gcagggacaa accacatcga aaccttagta 

76321 atttgtagca tagttaaatt ctttttcaca tgatgtattc tgtgctggga tactccacat 

7 6381 cctgaatatt ttaatttaat ttgaatagag tttgatttac ccattttgct gtaaaattcc 

7 6441 gtgtgttttg acaaatgcat agttgcaggt atccattatt aaagaatcat atggaatgct 

7 6501 tcaaatcccc accccatgca gccaatggca ttcccatctg tgcagtttgc cttctccaga 

76561 atctcattaa atgaggtcac actgtgtgtg ttctcctcag actgtcttct tccactcagt 

76621 aatgtgcatg caagattcac tcatgtcttt gtgtgagttg atagcttgtt cctttctatg 

76681 gctaaatagt attccattgc atgaatgtac cacaatttgg ttatgcattt tagggagcaa 

76741 aaccttcctc ttctaacttt gttccagggt tggagacctt caaattaact gacaatagat 

76801 acattagtag gagaggcaat acttggcttc ttattccaca agtatcattg tgggacaaaa 

76861 ttcatcagat ggcaggatct agtttacaaa gaggtgaaaa tagcccaaaa cgagaaacaa 

76921 gactagaatc tgataaccca caagggctat agttttcctt tttaaaaaaa tttttttttg 

76981 agacagagtc tggctctgtc gcccaggctg gagtgcaatg gtgcaatccc agctcactgc 

77041 aacctctacc tcctgggtgg aagcgaacct ccctcctcag cctcctgatt agctgggact 

77101 acaggcacat gccatcatgc ccagctaatt tttgcagttt tagtagaggt ggggttttga 

77161 actcctggcc tccaaaagtg ctggaattat aggcttatgc caccatgcac ggctgagtta 

77221 tagttttcca ttgaaacata aaatttctct ctgtagtaac catcattttt gatcatagat 

77281 aatcaatgtg agattattct tgttttaaaa ataagtctag tttcgttaga ttttgcctga 

77341 gtatttatgt aagtgcagca agaacaggag gtgaccacgt aggtgctttc aagcttcttc 

774 01 gctggaagtt ttcatacaga atctcagatt ggacttttaa aggccttatt gaggctaaaa 

774 61 gccaagccaa gaacatactt tcaaatttca gctgcagtca ttgtagcttt atgtgaattc 

77521 ctctcttctt gacgccccca aaatatcccc aaattcctgg gcctagcagg aaattacctt 

77581 ctttactaac ctgtaaggct gtgaaccgtg taatgtaggt accaggctgg cttttctcaa 

77641 agtgctttgt aagcattggc ttcataaaag tcagccttag ttccttaaaa ttgctggtca 

77701 taactgatct caggtatact attcctaaat atgatattcc agtaaaagcc tgataatata 

777 61 atcaaagttt ccaattatgt cttgctataa ggtgaacaga tgcttattgg acttctgcta 

77821 acaactatat tgtcatgaaa ataagagtat tcagtaagat tttcaaaatt ctggagaaat 

77881 caggcaggga aaaaaagatt tttttttttt ccccacccag cctcatttct gtttacaaaa 

77941 gtataatcta ctaaattttt gtgagttata gttagcttaa gagaaagaga tttcttaaat 

78001 ccagaaacta gaacattaaa gaaccagcag tactccaaaa aagctataaa attataatca 

78061 attttcatca cttcattcag tgccatgtaa tcaattccag tcttgctgga tcttgggtta 

78121 gcagtgtcac gaacccatcg atttctcaac cagacttctg gagatcttca ctgagtcaag 

78181 tgtatggtct taaagttatt taagcaatat catcagaagc ctacaaccag agtacctgtc 

78241 ataggctttt tcgtgagtct cagagcgagt cctgtgttgg agacaaacat tctgacctgt 

78301 agctgattgc aggagctttc aggaaagtat cagggggaaa tagtatctaa atgccaaaga 

78361 gtatgaaatg gctgtgatga aaggtcagat gagagttcat tataccacaa ctgacaagga 

78421 tattcgattt ttttggtggc atacaacgtt taaaataata attgaaatta tgactcaaaa 

78481 cagtataccg gcacatagca tggataagga ggacattgac aaatttccag taattttata 

78541 caatttctga aaacataaca ttttatccat acaaatataa cccagggaag gttaggtatc 

78601 tctttttatt ttatattttg tatggttttt cttataaaaa atacatccta ctttacctgc 

78661 aaaacatgcc ctgctttttg catacttttc atagagttgc ttctagtttt taatacttag 

78721 taatctctat tttccagaga aactaggaag aagacaattt taaactgtca tacattagca 

78781 ttctatagta gattagaaaa tgtatgagta taccatctcc caacatctag agggatgtgt 

78841 ttcctcattg tacaatttct cagtgtggta gacaaaaata cgtttattaa cgggccaaaa 

78901 tatctttact ctctctgtaa aaacaagaag ccaaaagtat ataaacttga attacttatg 

78961 ttcagtaatt aatgttttag tatcgtatct tatttaaaat gatctagata ttgaatgcaa 

79021 atcttttact tagcttaact ttaaggttaa aaattaccaa aagtattttg gaaactatta 

79081 ttaggcagat ttactgtaaa aaattattat tgaaataatg cttttaataa gaatgacaat 

79141 tagaatcaaa tctataagct ttaagatttt aaggatctag taagtataat attagcttat 

7 9201 ttgagtagaa ctcaagcaga atagaaattt gttttatatt taatagtgat aactctgaag 

7 9261 acatagttgt tttattacac caaaaatata aaattactct tatttaacta agttttatcg 

79321 aaattgtgtt aacttgaaaa acatttggat cagttcctat atttattgga gtttggggaa 

79381 tatttattta taaatgcttg gttttttttc caagccaagt tagaatagag cacttttaga 

79441 agattttata agtgaatttt gcaatgctct ctggagtgaa gaaaaatcac atatacataa 

79501 catacattaa tatacataca aacacaaata gaggtctcat agctttcatc ctgaaatatt 

79561 agccatgaat caggcataaa tattctgatg gttaatttta gacatctgct tgattgatta 

79621 agagatacta acatagctgg aaaagcacaa tttctgggca caagtgtgag ggtgtttctg 

79681 caagacactg agataaggaa gatccaccct gacccaatgt agataggcac tgatatggtt 

79741 tggctgtgtc cccacccaaa tctcatcttg aattgtagtt cctataatcc ctacatgttg 

7 9801 agggatggac cctgtggtag gtgattgaat catggtggtg gttactgcca tgctgttctc 
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79861 atgatagtga gtgagctctg atgctctgat ggttctacaa ggggcttttc cccttttgct 

79921 cagcacttct ccttacagct gccatgtgaa gaaggactct ttgcttcctc ttttgccatg 

79981 attgtgaggc ccctccagcc atatggaact gccagcccat taaacctctt tgttcttttt 

80041 tttttttttt tttttttttt ttttttttga gatggagtct ggctctgtct cccaggctgg 

80101 agtgcagtgg cgtgatcttg gctcactgca agctctgcct cctgggttca cgccattctc 

80161 ctgcctcagc ctcccgagta gctgggacta caggtgcctg ccaccacccc tggctaattt 

80221 tttgtatttt ttgtagagac ggggtttcac tgtgttagcc aggatggtct caatctcctg 

80281 acctcatgat cctcccgcct tagcctccca aagtgctggg attacaggcc tgagccactg 

80341 tgcccagccc aaacctcttt tttctttata aattgctcag actcaggtat tttttcatag 

80401 cagtataaaa gtggactaat acaggcacca tccaattgat tgagagccca gatagaacaa 

804 61 caaggaagag gaaaggtgaa ttatctcctt ctgaaactga aatatccttc ctcccctgcc 

80521 cttgacatcg gagctttaga gttacaccat tggcttccct gattctgagt cctttggaca 

80581 tggactgagc catgctacca gctttccctg ttctccaact tggagacagc ctatcgtgga 

80641 acttctcagc ctccataatt atgtcaacca attcccctaa tgagtcttct ctcatctatc 

80701 tatctacata tatcctattg attctgcctt tctggagaac cctaatgtgg ttacaataac 

807 61 acaaaattca ctagtttata tggaagactt agtttttgcc tttgccccat tttatatttg 

80821 tattataact gtttctggaa aatggaacaa gttttcgtct tcttcatatg agggctaaag 

80881 cttttttotc actaatattt ttggagattt ttaagatttt cttttgtctt gacatacaat 

80941 cttatgaagg ctgagaatta aaattatttt tctattttat ttttcaggct caagtgtttg 

81001 cttttgtaga ttcttgagca cgttgagagc ctccaaggct tggagggggg tgcctaaagt 

81061 ttcagtgatt atagggagtt gagagactca actgggaaag gaaacgtcta aacagaggca 

81121 atttgagaga taaaagtttt ctcaaaggag ccattaaagt tctaaataat tcttagtaaa 

81181 gtcatgcaaa caggaaagga agtagacagg attagttcct tattggtgga acacatagtc 

81241 agcggaggtt tgggaaggga gattttagtc aactgagaag ttcccatgaa aggagcaaga 

81301 tcaagatctg atggaagggg aagagacacc atgaaacaaa atccaggaat aagttccaac 

81361 ccaagaggag aacagagagg cctcaaaacc aaagctagga taagaaactt ctagcctgag 

81421 agttaccttc tagacaaaga agactgagat tccaacccag cttcagagag tactcacatt 

81481 ttgatgttac tcaaacttta ggctttttaa tgacttagcc atgcatgcaa aaggcattcc 

81541 ctaaggtggc acagaagacg gagcccctat atccaaagat agccaaggag aaagaaagac 

81601 ccctgttgcc agagccagtg gataaaggca acagaaaaag agacaagggt ccttatgtga 

81661 tgagaccttt tcagatttag gcttatataa actcctgaga actgggagga tgagagccac 

81721 agatggggta ccaacatttc tactcatttg attacaagtt ctcgggcatc caaaatgatt 

81781 aacaaaatga caatttctag ggcttctgtg ggatagtatg gaaggtcttt ttgaacgttt 

81841 taatgctgtc aactgaagag tgatgaagtt catatatttg gaaaggaaag atttctttat 

81901 ttttatgttt ctttattctt tttttttgaq acqqagtttt actcttgttg cccaggctgg 

81961 agtgcaatgg catgatcttg gctcactgca acctccacct cccaggttca agtgactctc 

82021 ctgcctcagc ctcccaatta gctqaqatta taqatqcata ccaccatgcc cagctaattt 

82081 tttgtacttc tagtagagac agggtttcac catgttggcc aggctggtct ccaactcctg 

82141 ctgttagatg atccacccac ctcagcatcc caaagtgctg ggagtacagg catgagccac 

82201 tgcgtccaat gagagattta' tttcctataa agggttacag cctgcagggt agttcttctg 

82261 acaggctagg aagtatagcc tccagccaga agccagaaac agacattttc aaatgtgagt 

82321 taaaggaaac agtaatttat gctgagtggc atggccaaat acacatattt aataagctct 

82381 aggaggagtc atgaatattt atgaaaggag aaatgcgtgc atgcgcaatt gagtgtcttg 

82441 ctccttcatg ggtcccatgt acaaaaattg gcagtgttag catgatccca ggatggagtt 

82501 ttcagccccc taacactaaa aggtgaagca gaggacatga aaactcactc tgtgcatcct 

82561 ctgtacgctg gccagaacct ctccgtcatg ggtggtctct tatcaggcaa gaaaggagag 

82621 gcggcttcag gcagttggtt gatatcagtg gtggagtctt tttcaagggc tggtttctgt 

82681 taaatcctta gggaagaaag cctcatcatg gttagcaaag gagagggtat aacaaggtgt 

82741 atctgactcc catcatccca tgctggccaa gctgagaact cagttttgaa agttactctt 

82801 gggtcccctc agccaagagt gggtctgttc agtcagttgg gagcttagga tttcattttc 

82861 atttatcatt gctaatggga aagggtacgc tgtctccatg gcagctgaat tcgcaagaaa 

82921 ctccttggat ggggttaatg gcagYtgtat ttttctgKga SctYKgcttt aattggataa 

82981 agtaagttct ggtaagattt cttccNttat cttcagtatc tcaaRtgttt tcaYttaaat 

83041 aatctttata acaacttttg atgtctgagt ggaKtcccac acagtcatct attgtaagac 

83101 tttctgattc cttttttttc ctttggtcat tMtgaatagg gcttctgtaa ataaYtgcat 

83161 ggtagctttt gWtKggaaat aacatcaaag tagttgtcaa aatacYtagg aatgtKattt 

83221 ttggattgta aggKgagact tgtttagctt tRgaaaaaaM tgMccaactt gtaatgggga 

83281 ggaaaaaaat tttctNtgtt tttggaattc ttagatggRa cScKctgtaa aaactgacag 

83341 attaaaatga gaaaaaSaga aaagtttaaa aacaYgtata YcttatggWt acatgggaga 

83401 tactcaggga aaaatgagta aatctccaac aggtggcttt caRttcaagc ataaatacta 

834 61 tcttcaactt aaagaaagaa gatttgaggt gcagtggtNa gtggggagtt aaccagcaaa 

83521 agcacattag acaagggtaa ggttYgttat acagacttaa gtccaYgcat tctccattga 

83581 taagactctt Yagtgattta gttatccttc tcttcttgKt gtcgagagag gtagctttta 

83641 aatggtgatt tcctttatag atgtaaattt tccttacaca agtgtaactt ctactctRtt 
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83701 ttcacaactt cgttagcatt tttttttttt tcaaaataat cagcttggaa taattcttaa 

83761 gccaaaggga catattttgK ggtKgcatat tctggtttcc taccattata ttttggggtg 

83821 gcataKtttg gYcttataca ctgtgttcca cYggcaatga aaagagttct tgtttttcct 

83881 ccagcaattt gtcattttgt agtttagcag ttctaagagc tatacaccag ctgtgctatc 

83941 tcactgtggt tttcggttct ctagtatgtt gagcatcttt ttgtatgttt acttgccatc 

84001 tgtagatctt ctttggtgag gttctgttca gatctgtgtg cattttcaat tgggttgttt 

84061 aacttattgt ttagttttaa gattttttta tatattttga atacaaattc tttctcagat 

84121 ctgtattttg caaatatttt cttcaatatg tggcttgtct ttttgttctc ttaacaaggt 

84181 ctcttccaga gtataaactt taaatattaa gaaatccaca ttgtcatttc ttctgtgtat 

84241 atcaaccttt tgtgtcattt gttaaaattc attaccaaac ccaaaggcac atagcttttc 

84301 ttctatagtt tcttctagaa atggtacagt tttgcatttt tagtgtaagg atgattttga 

84361 gtgattattt gtgtaagttg taaagttttc atctacacac atatcatttc ttatggtttc 

84421 caattaactg ttccctattt ctgggaaaga cacaggatag tgggctctgt tagagtagat 

84481 agatagctag acatgaacag gagggggaag ctcctggaaa agggaaagtc tgggaagcct 

84541 cacctggagg taccaccaaa aattcacata ttagtagcat ctctagtgct ggagtggatg 

84 601 ggcacttgtc aattgtgggt aggtgggaga agaggtacct atgcagaaag aaacacccta 

84 661 gaattcctct taagatgccc caatcatcat tcattctgca ataaaaatgt catacatcta 

84721 ctctactgca cccagccctc ttctgcaatt tcaataatca attgtgctat ttgcctttct 

84781 ttcagcaatg agattttatt tttctttcct aattatttca aacatgaact ttggttccag 

84841 agaactagta tttccttgat ttataaattg agggcagctg ggcatggtgg ctcacgcctg 

84 901 taatcccagc actttgggag gccaaggcag gcagatcact gaaggtcagg ggatcaagac 

84961 cagcctggca aacatggtga aaccccatct ccactaaaac tgcaaaaaat agccagccat 

85021 ggtggcaggt gcctgtagtc ccagctactc aggagactga gacaggagaa tcgcttgaac 

85081 ccgagaggtg gagactgtgg tgagccaagg tcgtgccact gcactctagc ctgggtaaca 

85141 tagggagact ctatcctcaa aaaaaagata aaaaaattga gggtcatctc acagacgatc 

85201 taataatgaa ttattttttt gtctttagaa aatcaacatt aacttttcta cttttagata 

85261 tcgtaactgc tgtgacttga aggacttatc tagaaaaagc cttaaaaaac tacggtcagc 

85321 actgggtgaa tgggttgggg gaacccacat aaaatcccca agacacctgg gagtccatgt 

85381 ccccatgagt gggactgcag gcagctgtag cagactggat gggagagggc agcaggcagg 

85441 agaactcggg gtctggagtc cacggttcta aggccaggga aaaccactgg caaagtgaaa 

85501 tgcggagctt gacaggatga aatttgtgat tgtaaatgaa tatttgccat ttccaagtga 

85561 gatcgccagt ggtggtggga tggatgggtg cccctccaag tgggctgcag tgaggagagc 

85621 gtggcaccag gccaggatgc tcctgccagg aacacaggat ctgcacacgt ttaggaggaa 

85681 acgctgggca gacccagctt ggagtcatct ctgctcttta catctgttaa ggctgtgaaa 

85741 actgagagtc ggccggatgc agtggctcac gcctgtaatc ccagcactct gggatgccga 

85801 ggcgaatgga tcacctgagg tcaggagttc aagactagcc tggccaacat ggtgaaaccc 

85861 catctctact aaaaatacag aaaattagcc gggtgtggtg gtaggtgcct gtaattacag 

85921 ctaatcggga agctgacaca gaagaatatc ttgaacatgg gaggcagagg ttgcaatgag 

85981 cagagatggc gccattgcac tccagcctgg gtcacagagg gagactccgt caaaaaaaaa 

86041 caaaaacaaa aacagaaaac tgagtctcag gaacagttcc cgagaaggaa aattgggccc 

86101 gcatggaaat agacattttt ctcccaccta gggcagggag tgaagtgaaa taggtctgtg 

86161 gagtggactt tcacatagaa accatgtatt tcctaaattg ggggttattt ggggatcacc 

86221 tggaggagta ttcctggttt tggtgaaaca cacgggggta ttttttgtga agctgcaaat 

86281 ctggcacagc aataacRgct ggggaactgg agatMaagga gaaggcatac taagtgctgt 

86341 tgcaagtttc ccagaagtat gacattattg ggaagtaaac tactttttaa aacaaccgtg 

86401 gcaataccac gtcagtaagS cagagacaac accatgaagt ttcatgacag aggccaaatS 

86461 cagcccaaac ccagcccagc agaggctgtc aactcagcgc cccagcgaga gccggaaggt 

86521 tccatcctca gagctgcaga ccctctcgtg tgggctgcaa aggccatgtc tgcatcccgg 

86581 gcggtatgta cgctctgaga gatacatgcg tgttccgggg gttatatgag tgtgacgggt 

86641 gtggcgtgag tctgactgtg tcacgggcgt tccaggggtt acgtgtgtgc tctgagggac 

86701 acatgcgtgt tccRgggRtt atatgagtgt gacggStgta gcgttaggtg acRatgtcat 

86761 ctccgMgttc caagcgttat gtgcgcactg agggacacat ccacgttccc ggggttggat 

86821 gtggaaggca gctaccccga cgggtgtgct ctctgcatac gacgggtgct aacactagca 

86881 tcacagatgc agtgttatta gtactacgga ggttattatc agtgtggcgg gtgttctaat 

86941 tgctttcctg acgctacatt tctgttccaa gaccgcagct tggccctgtg gctgcctcgc 

87001 cttgggtgtg gagaatgaac ctcgagtgcg ctggattcac aggggatttt ggtttctaat 

87061 tttccacatg aagggccaag gcgggtggat cacttgagat caggagttca agaccagctt 

87121 gggcaaatag caagaccctt gtgtctacaa aaaaaaaaaa aaaaaaaaat tagatggcat 

87181 gcttgcacgt acctgtagtc ccagctaccc aggaggctga agtgggagga ttgcttgagt 

87241 ccagaagttt gaggctgcaa tgagccagcc atgatcgcac cactgcactc cagcctgggc 

87301 gacagagtga gaccctgtct ctctctctct ctctctctct ctctctctct gtcacacaca 

87361 cacacacaca cgttaaattt gttggattat atatttaggg ggttgagcac ttttggttat 

87 421 aaaatattta tgattgtggg aacaagttaa taaagacacg aaacttattt aaatgtccca 

87 481 gaactttaag aacaaaaagc attcttagtt taaaaataag ttttacttta aaggtaatag 
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87541 


tacacacata 


aattgttgtt 


aaaatcgaca 


gtaacaaaga 


gaagtaacaa 


tactaatggc 


87601 


ctgtcacaaa 


ctgattctta 


ataacctata 




ttaagcccgg 


gcgctgggtg 


87661 




gttatcccag 


cactttggga 


agccgaggca 


ggcagatcac 


ttgaggccag 


87721 


gagttccaga 


ccagcctgac 


caacatggtg 


aagcccagtc 


tctaccaaaa 


atacaaaaaa 


87781 


attagccggg 


tatattggca 


tgcacctgta 


atcccagcta 


cttgggaggc 


tgaggcagga 


87841 


gaatccttga 


acccaggaag 


cagaggtcgc 


agtgagccga 


gatcatgcca 


ttgtgacagg 


87901 


agagaaactc 


tgtctcaaaa 


aaaaaattat 


atgtttacaa 


caggtgcatt 


cctcctcttg 


87961 


ctttctgagg 


acgccctgct 


atgtagctga 


gtagtcgcta 


ataaactatc 


ttaacttcac 


88021 


tatactctgt 


gactligccaa 


aaggtctttc 


ccatgtgaaa 


tccaaaaacc 


tattcttggg 


88081 


gtctaggaca 


agacccattt 


tataatgaca 


aaactataca 


aattctagag 


gaaaacatgg 


88141 


aaagaaagct: 


atgtgacctt 


gSgtttggcc 


atgagtttta 


acacgacact 




88201 


tttgaacccc 


tcctctaYat 


gaactccagg 


gtggtttatg 


ttccaYtggc 


taagRgaaag 


88261 


ttttcttcaa 


aaatgtgaca 


tgatttgagg 


tcaaacatta 


atatcaagKW 




88321 


attgagaagc 




tctgggaaaa 


ccagaagtgt 


gccttttttt 


ggaaataatc 


88381 




caaacttaaq 










88441 


tttcaccaat 


taagatataa 


atgaagctaa 


Ygaaatccgg 


aaatacaatt 


tcactgtttt 


88501 


taatgttcat 


taaaaaaaaa 






gccccagtaa 




88561 


aagtctttac 




caaaatccac 


ctatqtcctq 




cactttacRa 


88621 


gcctcattat 


atgtactgga 


gacaaaattc 


aqaaataaat 


aaatatatat 


atacatatat 


88681 


acaaatatzali 


ttcaaattaa 


aaaatacttt 


taqaqagtqq 


tatgtattac 


atttaqaaat 


88741 






ggatgtcatc 


cacgcctgtc 




cqaatttata 


88801 


aatcatctca 


ggtgcggagc 


aggacaggtt 


gaaaatagga 


atgacatgaa 


cccgcgcgga 


88861 


acagctgccg 


gcgcggtigtc 


ca gggcggca 


ccccgcccgg 


tccYggcccc 


tccagccctg 


88921 


ggcccgaccc 












88981 


ccgtgtgggg 


ccgaagaggc 


tgctacccag 


aqqcqqaqta 


caaactcaca 


aaaatcccca 


89041 


cccSactctc 


gctcccgcca 




actcgcgtcc 


ccaccacaca 


ccaactcaaa 


89101 


agcagcaccg 


cccccggcac 


aggagcctca 


cgcgcctctt 


acctaacagg 


aagttgggtg 


89161 


gaagcagcgc 


ggacccacgg 


cacaccgaac 


gcactccaac 


agaacccgac 


gcagacacgc 


89221 


gctttcaacc 


ggcggagaca 


ctggcaggtc 


aatagagata 






89281 


a'tgacaaa't't 




tggataactt 




aatattgtqa 


atttttgtga 


89341 




aaaaggcaat 


gtcactccaa 


taatcaccag 


aqtaatcaat 




89401 


ctgtcccttt 




ctctggtatc 




tttttaacta 


atgatqcttc 


89461 


ttaaagaaaa 


gggaaaagac 




ctttcaqtct 


tcaatqattc 


actgcttcat 


89521 


ctcgctccac 




tqaaatctac 


atcLcttata 


cattaacaat 


gcatgacaat 


89581 


ttacaaatag 


ctaaattttt 


ggagctaact 


ttaagtacct 


gaatggaatt 


taatcaaccc 


89641 




ttctcacLtc 


tcagttattt 


atcaagttta 






89701 


ttatccaaac 


attgtttaaa 


acaatcatca 




acacttatcc 


aggggggtut 


89761 






ggattattct 


aatgtcagag 


taqaataaaa 


aataagtgca 


89821 


gcgatgctga 


ctcttccaag 


cttaacattt 


ctcacaagtc 


aattagcttt 




89881 


gagggcgtga 


agggctgctt 


gcggtagttg 


tgtagcagca 


gcacaatggc 


cqcaqacaacr 


89941 


gaaaacagtt 


tctaaaaalili 


cctcgtatat 


aattttatat 


ttttgacaag 


attaatgacc 


90001 




tcctctccat 




ggaattctgt 


tggtatgtag 


ttactatatt 


90061 


ttattaaagg 




tatctctttt 








90121 


ttcctttata 


tttttattaa 




tattattaaa 


qgaaattaqc 


cttatctctt 


90181 


attatatttt 


ttatgacctt 


caaagtagtg 


tctctgctta 


aaagtgtacc 


ctggccgggc 


90241 


gtggtggctc 


acacctgtaa 


ttccagcact 


ttgggaggcc 


gaggcgggtg 


gatcacgagg 


90301 


tcaggagatc 






ggtgaaaccc 


cgtctgtact 




90361 




gggcatagtg 


gcgggcgcct 


gtagtcccag 


ctactcagga 


ggctcaggca 


90421 


ggagaatggc 


gtgaacccgg 


gagacggagc 


ttgcggtgag 


ctgagatcgc 


accgctgcac 


90481 


tccagcctgg 


gcgacagagc 


aagactccgt 


ctcaaaaaaa 




NgtRtaccct 


90541 


gaagcacaca 


tcaagcgaca 


tgtagagttc 


ataaattctg 


gccaaatggt 




90601 




caStaaggct 


ctttacttgc 


actgacaaat 


atgaacgctg 


gggaatttgg 


90661 




taatatataa 


tattatatat 


ataatagata 


tataatatat 


aatatatata 


90721 


atacatatat 


aatattatat 


atgtaataga 


tatacaatat 


ataatatata 


atagatatat 


90781 


aatattatat 


ataatagata 


tataatatta 


tatataatag 


atatataata 


tataactttc 


90841 


catgtgattt 
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ELP3 genomic sequence (SEP ID NO: 2'> 
>8: 27927301-28022150 



1 


gagaaaataa 


ttcaaagttg 


tatttaaatg 


ttcagagggg 


gaataatacc 


aggatccttt 


61 


ctcaaaaaag 


aaacacatgt 


ctggaatagg 


cagtgatgtg 


ctgcagccag 


ctcgtaccag 


121 


ctcctgaaag 


ccaacagcac 


acatcacttt 


cctactctgc 


attcagtgac 


gtcacactgg 


181 


tagcttgaaa 


ttgacaatgt 


tgggactgtt 






gataacccaa 


24i 


atccataacc 


acattgtgat 


ccctaatctt 


caaccaaata 


attccagctg 


acatgtgacc 


301 


acagcttatc 


aatactaaga 


ggcacatcat 


ttccctgttt 


aagatttctg 


aaattgagat 


361 


gcatcttgca 


actgctatag 


accaggtgac 


aatcacaagg 


tggttgtcac 


tgcctgcgtg 


421 


ttatgggctg 


aaatgtatcc 


ccctccaaaa 


ttcaaatgtt 


gaggtcccaa 


tcYccactaa 


481 


ctcctattta 


caccacagga 


attggcaaat 


ggtacacatc 


agccttttgg 


gttttttgtt 


541 


tgtttgtttg 


ttttggggga 


tttttttgtt 


tttaaacaaa 


gagctagttg 


ttaaacattt 


601 


acaagcgcaa 


cgcaacacaa 


acctggctaa 


ggtcctagaa 


aagtgacagt 


ttcttaagta 


661 


gttagtgagt 


aatcatacac 


attttaatgc 


agactaatgt 


tgaaacatta 


gtctgggtgc 


721 


ggtggttcac 


acctgtaatc 


ctagcacttt 


gggaggccaa 


ggcaggcgga 


tcacaaggtc 


781 


aggagttcga 


gaccagcctg 


gccaatatgg 


tgaaacccca 


tctctactaa 


aaaatacaaa 


841 


aattagccag 


gcatggtggt 


gcgcacttgt 


agtcgcagct 


actcgggagg 


ctgagacaag 


901 


agaattgttt 


gaacccggga 


ggtagaggtt 


gcaatgagcc 


aagatcatgc 


cactgcactc 


961 


cagcctgggt 


gacagagaga 


ttccatctca 


aaaaaaaaaa 


aaaaaaaaag 


tcaagcaatt 


1021 


aagaattatt 


tgagggagtc 


tttttttgtg 


cttagaaaaa 


cagtgtcgtg 


gagaagtcat 


1081 


actagaatca 


tatagaaata 


gcattacata 


ttctttcatt 


cttcttactt 


aaaaaaaaag 


1141 


agaaaaacat 


tcacctatag 


aatgtgcatg 


tgcgtgtgtg 


ggtatttgag 


ggaggtagaa 


1201 


agcagtgaag 


agttgtcaat 


ctctttctga 


tgcaaagata 


taaagagctg 


agtgagtaga 


1261 


cgccttgcag 


tccactggga 


ctggtgagaa 


gccgtgtccc 


tggcaagcta 


taagcacgtg 


1321 


tagttaaggc 


tggaatacgg 


atatctgtgc 


agactgccag 


aagcccaccc 


ccaccaaccc 


1381 


ccttatagag 


tgagctgaag 


ttgcaaaaag 


aatctatctg 


ctgagatttt 


gctttgattg 


1441 


agttttcact 


ctaggaaaaa 


gtttagggaa 


agctgaaact 


attttctact 


gggagatgaa 


1501 


attaatgtag 


aattaagttt 


aagtagagtt 


ttaaaRcagt 


tccaagaaag 


gcatgtgctt 


1561 


agctgtgtcc 


agatgtttgc 


attatttaaa 


atctggacct 


aaatcttgta 


taacttttct 


1621 


aaaaattact 


gaagattagt 


taacatttat 


gctttttatt 


gtagtaaaaa 


gaaatgtttt 


1681 


tatagtgtca 


gtgaagtgac 


tctagtataa 


agggttcata 


ttattgtaaa 


agagaagtag 


1741 


aaacagctac 


agctagaggt 


aggatccaag 


atgttagtag 


atattcatta 


aattatgcaa 


1801 


cagaaacatt 


acacactgtt 


ctaatttaaa 


cacatggctt 


gtcataataa 


aacgtaacac 


1861 


aatatattaa 


agtttgtaaa 


aagatgttgt 


atgtgagaga 


ggagaaaacg 


tggaagctca 


1921 


aaccagaagg 


gaggctaggt 


gtggtggctc 


acacctgtaa 


tcccagcact 


ttgggaagcc 


1981 


gaggcaggtg 


gattgcctaa 


gctcaggaat 


tcaggaccag 


cctagccaac 


atggtgaaac 


2041 


cccgtctcta 


ctaaaaatac 


aaaaaattag 


tcgggcatgg 


aggcacacgc 


ctgtagtccc 


2101 


aactactcgg 


gaggctgaag 


caggagaatt 


ccttgagcct 


gggaagcaga 


ggctgcagtg 


2161 


agccatgatc 


acgccattga 


actccagcca 


ggacaacaaa 


ggaagacact 


atcttaaaaa 


2221 


aattaaataa 


aaggccgggc 


acggtggctc 


atgcctgtaa 


tcccagcact 


tcgggaggcc 


2281 


gaggcgggca 


aatcaccgga 


ggtctggagt 


ttgagaccag 


cctgatcaac 


atggagaaac 


2341 


cccgtctcta 


ctaaaaatac 


aaaattagcc 


aggcatggtg 


gtgagtgcct 


gtaatcccag 


2401 


ctactcggga 


ggctgaggca 


ggagaatcgc 


ttgaaccagg 


gaggcagata 


ttgtggtaag 


2461 


ccaagatcgt 


gccattacat 


tccagcctgg 


gcaacaagaa 


caaaactctg 


ttaaaaaaaa 


2521 


agaaaaaagg 


aaaggaaaag 


aaaggaaagg 


aaaggaaagg 


aagaaagaaa 


acagaagggt 


2581 


tattcaatgg 


aagcagaatc 


agaaaggata 


tagtaaagtt 


ggagttacac 


tgaaaccagt 


2641 




atgacccagc 


tgggtgaggt 


ggctcatgcc 


tgtaatccta 


gtactttggg 


2701 


aggctgagac 


aggtgggtca 


cctgaggtca 


ggagttcgag 


accagcctgg 


ccaatatggt 


2761 




cttcactaaa 


actacaaaaa 


ttagccagtt 


gtggtggcgc 


atgcctgtaa 


2821 


tcccagctac 


ctgggagact 


gaggcaggag 




acccaggagg 


cagaggttgc 


2881 


agtgagccaa 


cattgtgcca 


ctgcattcta 


gcctgggcaa 


cagagccagg 


ctccgtctca 


2941 




aaaatcatga 


cctgattttt 




ctgaaatcag 


tgacataaat 


3001 




tttttaaatc 


caccttttct 




tgaaaattat 


gttgcagtga 


3061 


tgtcactctg 


ccactgtctt 


acttgtatga 


aataatttga 


tctgagcccc 


tagcactctt 


3121 


tttttgctgt 


tgttagagat 


gggggtcttg 




ctagctggtc 


ttgaactcct 


3181 


ggcctcatac 




cctcggcctc 


ccaaagtgct 


gagattacag 


gtgtgagcca 


3241 


ctgctcctgg 


cccaattatt 


atttttaatg 




taaaagaaac 


cccagattcc 


3301 


tggggagtag 


ttgttccata 


tcttggggca 


gggaaaaata 


aagtgaacct 


gaatatcttg 


3361 


ttgtagctag 


acagcaagaa 


tgttttttaa 


atgctaaagg 


gagtaatgct 


aaagggagtc 


3421 


agcttaaagg 


gttcccactg 


gccaaaaagt 


gacacattga 


gcatcagaat 


gaacacaata 


3481 


gttctgtgaa 


aagctatgag 


ctcattatga 


tcctaaaact 


gagtaaataa 


aataaggtat 


3541 


gttaaagtaa 


aggcattaac 


cacctcaaac 
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3601 ctgtaaattc cacttgggca agtcctaaaa gtttacctaa catataaaag aatattaaca 

3661 gtcttagtag gggcacaaaa atcctgtcag gtcaaatcag gcagtaaaac tttgtgaatt 

3721 aggttaaaat aaaaagtcgt gtgacagttt atagtaaaaa ctgagccaag gcggccaggt 

3781 gcattggctc atgcctgtaa tcccagcact ttgagaggct gagggcagat cacctgaggt 

3841 cgggagtttg agaccagcct gaccagcatg gagaaacccc gtctctacta aaagtacaaa 

3901 attagctggg cctggtggcc catgcctgta atcccagcta cttgggaggc tgaggcagga 

3961 gaattgcttg aacccaggag gcggaggttg cggtgagcca agatagcgcc attgcactcc 

4021 agcctgggca acaggagcga aacaaaacaa aacaaaacaa aacaaaaaca aaaacaaaca 

4081 aaaaaaacca aactgggtca aggcaaaatt taagggaagg tcaagtgaac cataacatac 

4141 caggtagtaa gaaaagatgt ggagctagaa agcctcgtta ctataggatg gggcccaaat 

4201 caggcatctt ggaaggatct atgtgcctgt gtctcagcct tgaatctcaa actagtaact 

4261 caaatccata actacattgt gatccccaat cttcaaccac ttaacttcag ctgatatgtg 

4321 actacagctt atcaatacta agaggcacat tgtttccccg tttaatattt ttgaaattga 

4381 ggtgcatctt gcaactgctg caggccaggt ggcagtcaca aggtggttgt cactgcctgt 

4441 gtgttatggg ctgaaatgta tccccctcca aaattcacat gttgaggtcc taatctccac 

4501 tacctaaaaa tgtgacctta tatggaaatt gggtcattga ggatgaaatg tattaagatg 

4561 aagtcataat ggagtagggt gggcccctaa tccaatgact ggtgtcttta aaaaaagggg 

4 621 aaatttggat acagagacac acacatgagg agaatggcat atgaagactg gagttgtgct 

4681 gccacaagcc aggaattacc agaagcaagc agacagaact gtaccagatc cttcctgcag 

4741 tttcagaggg agcatggcgc tgctaatacc ttgattttgg acttccagcc tccagaactg 

4801 ggagataata attttctgtc gttccaagct acccagctgt ggtgcttagg aggttcatac 

4861 actgcacatg ggtgaacttg gtcacagctg ttcacattgc ccttccttct aatgagttgt 

4921 gtgcattact taactgccat ttaatatgtc ttttaaaaga ttatgttatg attcagcatt 

4 981 gaaaggtcag cattttccag gccgttacat cgaaaggcct ggaaatggaa acgtggggca 

5041 tatacttgat gttagtggaa gtatggttgg tgtagacagt aattccagat ttttttgcaa 

5101 agcaacacct aagtgcttca tgggacctag gaaagggaga catccacatg tagatgaagg 

5161 tgggttgtgt tttgatactg aggtgtatgc aaagtgactg cctatcccac acccagccaa 

5221 aaaaaaaagt gatggaaact caaagaatgc cttataaaga acttctgtct catgttttct 

5281 gtgattaaaa ctaaatttgc tccaaggcta ggcattgtgg ctcacacctg taatcccagt 

5341 gctttgggag gcccaagcag gagaattgct tgagcccagg aattagagac cagcctgggc 

5401 aacataccct gtctctacaa aaaatttaaa aattagctgg atgtggtggt gcataccggt 

5461 ataccggtag tcctagctcc ttgggaggct ggtggtggtc agggggtgcg gagtgcgggc 

5521 aggaagatag cttgagccca ggaattcaac gttgcagtga gttatcattg tgccactaca 

5581 ctccagcctg gctaacagag tgagacccat ctgggaaaaa aaaaaagttc caaacaacRg 

5641 catgtagtac aagtaaaggt ttcacaataa ttcaaactga aataaagtgg ttggagaatg 

5701 aaggcatcac cttgccatca gaggtgtggg ttttaggcct cagaagaact tgcaccatag 

5761 cggtgacagc tatagtcacc ctgcaaactg atgtattgct gcccaggccc taactcatac 

5821 aaaccaattc ggcatgttta ctggcatagt tgtaaattgt gtattgaagt attctacgtt 

5881 tccatttcca gggataatat gatcaggcac tcatacaatg catatttatt taataccaac 

5941 aagaacaatt caacagacca agaccagctg tactaagttt tgcctcgtta aaactggcat 

6001 tcaagaattg cctgatagaa tattttggct taagtatgag gcctaatgat attgcgctag 

6061 aaacaaaagc tttgaaattt tcaattccac gttgcacaac agacattttc tgtatcaata 

6121 ttattgaagg caaaatacag aaagaaattg aatatagagc tggtcatcgg ccaaatgttt 

6181 ttactattga atccaatacc gaagattttg tttcagccaa acatatcatc tgtcacatgg 

6241 tacttactct gatatatccc ttatgttgtg ttgtttatat tttatcttaa acttggtttt 

6301 agttttatag ttgtacaaga ctattatagg ccagacgtgg tggctcacgt tggtaatccc 

6361 agcactttgg gaggccgagg tgggtagatc accagaggtc aggagttcga gaccagcctg 

6421 gctaacatgg tgaaacctca tctccactaa aaatacaaaa attagctggg catagtggca 

6481 ggcacttgta atcccagcta cttgggaggc tgaggcagga aaatcgtttg aacccagggg 

6541 tcagaggttg cagtgagccg agattgcacc actgcactcc agcctgggca acggagcgca 

6601 actctgtctc aaaataaaaa agttaaaaaa aaagactata ataataataa gttgtatgtt 

6661 tgtacctact ttagtgataa aatagtaaaa ataggagaac actaagtgca gctcaggcct 

6721 tatccctgtc cactccctct cacactctcc ctctgagcct ttccaagcta cttgcctgtc 

6781 attcgccaaa cacagcagcc tgtgtgaacg gtgttcttgc ttctgcctca aaagtgcctt 

6841 ctcgactttt tcaccttact aaagactaca gcttctttaa aattctgctc tgcagagcac 

6901 tgtggtgctt gccgagctgt agtcccagct acgtgggagg ctgaggcagg aggatcactt 

6961 cagcccaaga gtttgagacc agcctgggca acatagtgga aaccccatct ttaaaaataa 

7021 taataataaa attggccagg tgcagtggct catgcctgta atcccagcat tttgggaggc 

7081 caaggcaggc agatcacctg aggtcaggag ttcaagacca gcctagtcaa catggtaaaa 

7141 ccctgtctct actaaaaata caaaaattaa ccaggcatgg tgaccagcct ggtcaacatg 

7201 gtaaaaccct ctctctacta aaaatacaaa aattaaccag gcatggtggc atgcacctgt 

7261 aatcccagct actcaggagg ctgaggcagg ataatcactt gaacccagga gacggaggtt 

7321 gcagtgagcc gagtttgtcc cactgcactc cagcctgggt gacaaagtga gactaagtct 

7381 caaaaaataa taataataaa ataaaattct gctcagaggc tgagttttcc ttgaacacct 
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7441 


ccccagtccc 


acctctcctg 


ccaccagtgt 


gacctggcat 


ctttcctccg 


tgtcccatag 


7501 


cactcaacaa 


ttcctccact 


gtatggagag 


cccagactca 


cctgggccta 


ctcctctgtg 


7561 


gaccacatgc 


tcattcgacc 


ctctcaacaa 


tcacgggaga 


tagatatcat 


tatctccctg 


7621 


ctttaaagat 


gagaaaatgg 


tgcctcagag 


aggtaactga 


agcagtagga 


ctgagttaaa 


7681 


tccaaactgt 


ctcctatagg 


tcagtaatcc 


cagcactttg 


ggaggtcaag 


gcaggtggat 


7741 


cacttgagcc 


caggagtttg 


agaacagtct 


ggacaacatg 


gtgcaatccc 


atctctacaa 


7801 


aaaatacaaa 


aattagccag 


acatggcggt 


gcatgcctat 


agtcccagct 


acttgagagg 


7861 


ctgaggtggg 


aggatctctt 


gagcctggga 


ggcagaggtt 


gcagtgagtc 


aagatcacac 


7921 


cgctgcactt 


cagcctggcc 


aacagagtga 


gacccgcgtc 


tcaaaacaaa 


agaaactiaac 


7981 


tgtctattcc 


aaaactttgg 


cccttgcaac 


tacacaatac 


caggcataca 


tttgtctccc 


8041 


aaaatctagt 


tgctgggata 


cagaaaaaca 


catagagaaa 


cctattaact 


caccttcagt 


8101 


ttttccttga 


gaattctagt 


cctttgtagt 


tttatcattt 


catttctttc 


taaaacagcc 


8161 


tctcgctgac 


tctgaatagc 


ttgctaggtt 


gtgaaaggaa 


atagaagcat 


gcagagagta 


8221 


gcgttatcaa 


cagtgatgag 


gtggcaagat 


gcacattttt 


ttggcagatt 




8281 


ccaaacagtc 


aaatatgcat 




agaattatac 


acagttgccg 


ggcacggtgg 


8341 


ctcatgcctg 


taatcctagc 


ccgttgggag 


gccaaggtag 






8401 


atcgagacca 


tcctggctaa 


tccagtgaaa 


ccccatctct 


actaagaata 


caaaaaatta 


8461 


gctaggcgtg 


gcggcacgtg 


cctgtagtcc 


cagctactcg 


ggaggctgag 


gcaggagaat 


8521 


tgcttgaaca 


caggaagcag 


agcttgcagt 


gagtcgagat 


cgcgccactg 


cactccagcc 


8581 


tgggcgacag 


agtgagactg 


tatctcaaaa 






tgacccttgg 


8641 


actacatggg 


tttgaactgc 


gtaggtccac 


ttatacgtgg 


attctttttc 


aataattgaa 


8701 


aaagatttga 


agatttgcaa 


caatctgaaa 




acaaaccctg 


tagcctagaa 


8761 


gtatcaaaaa 


attaacaaaa 


aggtgtcatg 


aatacataaa 


atagatgtag 


atcccagttt 


8821 


atatttgata 


atgtactacc 


acgaaatgca 


cacgattcta 


tcatgaaaag 


ttaaaattta 


8881 


tcaaaacttc 


tgtaaacatc 


tatagactac 


atggcaccat 


tggcagtcaa 


gaggaatgta 


8941 


aacaaacatt 


aagatgcagt 


attaaatcat 


aactgcataa 


aattaactgt 


agtacatact 


9001 


atactactgt 


aataacttca 


gagccacctc 


ctgttgctgg 


tgtggtgagc 


tcaagtgttg 


9061 


caaatatcca 


cttaaagcgc 




gtgatactaa 


tgatctccat 


gtgaacaact 


9121 


gctccagtaa 


attgcgtatg 


qcaataaaaa 


gtgagctctt 




tgtatttttc 


9181 


agcctgttta 


gagcaatacc 


ctaaatcttg 




tcggacccat 


gtgaagtcca 


9241 


gtgatcctgg 


aggggctccc 


aagtagcaga 


gaaaagtcat 


gacattacaa 


gaaaaagttg 


9301 




tatgtgccgt 


agattgaggt 


ctgcagctgc 


tgttgcctgt 




9361 


gtacgattca 


tcttgtaaac 


agatgaagta 


aacttatgct 


^^^^^^^^^^ 


acggtgccgt 


9421 






tccttatgat 








9481 


gttgttgttt 


gagatggagt 




ctcaggctgg 


agtgcagtgg 


cgcgatctcg 


9541 






cccaggttca 


agtgactgtc 


ctgcttcagc 


ctactgagta 


9601 


gctgggatta 


caggtccatt 


ccaccaggcc 




ttgcgttttt 


agtagaaacg 


9661 




atgttggttg 


ggctggtctt 


aactcctgac 


ctcgtgatcc 


gcccacctca 


9721 


gccttccaaa 


gtgctgggat 


'tacaggca'tg 


agt caeca ag 


tcctgccatg 


attttcttaa 


9781 


tgacattttc 


ttttctctag 


cttacttatt 


ttaagaatac 


agaacatiaa't 


acatacacaa 


9841 


aatatgtatt 


aatcgactgt 


ttatgctatc 


ggtaagactg 


ccaatcaaca 


gtggctatta 


9901 


gtagttaagt 


ttttggggaa 


tcaaaaatta 


tgtgtggatt 


ttcaattgag 


cagggagttg 


9961 


gcacactaat 


ctctacattg 


ttcaagagcc 


aactgtaata 




aaagtattat 


10021 


ttaaaaagaa 


ttagttcagg 


ccgggcgcgg 


tggctcacgc 


ctgtaatccc 


agcactttgg 


10081 


gaggctgaag 


ccggtggatc 


acctgaggtc 


aggggttcaa 


aaccagcctg 


gccaacgtgg 


10141 


tgaaacccca 


tctctactaa 


aaatacaaaa 


attagccagg 


catgatggtg 


ggtacctgta 


10201 




ctctggaggc 


tgaattagga 


gaattgcttg 


aacctgggag 


gcagaggttg 


10261 


ccgtgagccg 


agatcgcgcc 


aatgcactcc 


agcctgggtg 


acagagcgag 


actctgtctc 


10321 


aaaaaaataa 


aaaataaata 


gataataatt 




aataaaaaaa 




10381 


acttacattt 


cctgcctttc 


tttccctgaa 


atgaaattag 




caattactta 


10441 


atttttacta 


tgtgtcactg 


cagtctttcg 


actttcccca 


aaagcagcac 


acactccctg 


10501 


tgtctgtgcc 


ttttcataag 


ttctttcttt 


gacccaatgc 


tctctcttct 




10561 


cagcatccac 


ttgaaaaaag 


tgccttcatt 


tctcaacggc 


cagccagcgt 


tttgtcagct 


10621 


gtgaacattt 


tctgtgacct 


cactccctgt 


cccagtcaaa 


aatgcagccc 


tctcaccctg 


10681 


ggttcccata 


gcacctggct 


tgtcccttca 


tgtataactt 


ttaaccctct 


tttagatggt 


10741 


gagctcaaag 


actgggcaat 


gtgcctctga 


atccacagcc 


cctacctatg 


ttcaataaat 


10801 


gcttatcatt 


aataaattag 


atttaattat 


gttaattttt 


tttttttttg 


agacggagtc 


10861 


tcactctgtt 


gcccaggctg 


gagtgcagtg 


gtacgatctc 


agctcactgc 




10921 


ttccaggtcc 


aagcgattcc 


cctgcctcag 


cctcctgagt 


agctgggact 


acaggtgtgc 


10981 


atcaccacgc 


ccggccaatt 


tattttgtat 


tttttagtac 


agatgggttt 


caccatgttg 


11041 


gccaggatgg 


tctcaatctc 


ctgaccttgt 


gatctgcccg 


cctcagcctc 


ccaaagtgct 


11101 


gggattacag 


gtgtgagcca 


ccacactcag 


cctaattgtg 


ttaattttag 


tgttggcatt 


11161 


ggaaagggag 


ggctggagat 


attctctgct 


ggggtgagat 


ggggagaaat 


gcaggaggaa 


11221 


aaaaatgaag 


ctgcaattcc 


aacccagacc 


atgacagcaa 


gggaattcct 


ggagacagca 
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11281 gtcagaccta agcttcagtg ggtgtggagg aaatggtgga ggggctgtaa aggagctctt 

11341 gttggaggtg agacgcatcc gtgacatcac atcccgtgag cacacacccc atggatccag 

11401 gaaagcccag tttatacccg ctgatcccct ctaaggacta atcatacccc ctctcactct 

11461 caaaattgtc ccaatataga aggttagtta tagtcttcct aactataggg acaacagagc 

11521 tctccataga aatcaacatc tattaagccc aaaaatgcat acctttcatt gtaatttgtt 

11581 ataatacaac tcttagagaa tgcatctacc ttctcccgaa gttctaaaac ttttgaggac 

11641 ctgttccaag tgagttaaaa agttcaggag tctctggctg gggatgctgg gcttgatacc 

11701 taggtgatgg gttgatctgt gcatctaacc accatggcac atgtttacct gtgtaacaaa 

11761 cctgcacatc ctgcacatgt accctggaac ttaaaataaa aataaaaatt taatgatttt 

11821 taaaaagctc aggagtacaa aaacctgaaa catcaaagct gtaccctggg ctgggtgcag 

11881 tggctcacac ctgtaatccc agcactttgg gaggccaagg cgggcagatc acctgaggtc 

11941 cggagttcaa gaccaccccg gccaacatgg cgaaacccca tctctactaa aatacaaaaa 

12001 ttagctgggt ggaccggccg gcgcctgtaa tcccagccac tcgggaggct gaggcaggag 

12061 aattgcttga acccaggaga cggaggttgc agtgagctga gatcgtgcca ttgcactcca 

12121 gcctgggcgg cacagcaaga ctctgtgcca aaaaaaaaaa agctgtaccc taccagatag 

12181 gattataggc tgtaccccct agttatgggg gcatctacac tttctacatc tgtggagcag 

12241 gtgggatcat cacatcacac agtgcaggag gtacaccatc tgcatctggt acatgtgagt 

12301 ggcaccccca gagctgtgta ggggacacca gcttggcact aagcagctac cctggaaggt 

12361 tcagattcct aagacaaccc aaccaagggt cctgatgtcc caggaagtca cccagccttt 

12421 tctttcctcc ccaacaatca cttcctgaag ccctctaacc acgttacaaa gtcaagctcc 

12481 atgctactgt agtcaggacc cacagaggaa agcccctgac tggagcccac atccttgcaa 

12541 gcccagcaca ataccttagg tattctggca agtggagttt gtccatcttg gtgaccacca 

12601 gggccacgtc cctacagaag ccccgctggc aggctttgat gctctcattc agaaggtctt 

12661 cgtgggcttg ccccccagaa actcgctcta tgtcgctgat cacccagatc actgagcact 

12721 tgtcaatggt ctgcaaaaca tgaccacggc atgtgtcagc acaggaatgg agtcacagtt 

12781 cctgaggacc ggttccctaa tgtatccctt aagcctctcc ctccagtccc acccctcacc 

12841 gctccccacc gcactctgtg cactagctgg actgagctat gagtcatttc tcatatgtgc 

12 901 cttgttctgt taattccctg gccacagcct ctgcctagcc cccatttgtt cctcaagact 

12961 gtggaggacc tctcgggcat tgggctggcc tctgcatcct gtaacacctg gacttacctc 

13021 aagcagcgcc tctgagacac agtgactgca ccaggtctgc tctctgcccc ttgatgggaa 

13081 gcgactgtct gcacctttcc tcttgggatc cacctcacct caagcccagt gcttggcaca 

13141 caggagaggc ttcacagaag tttgtggcat tagagattga cagaattccc taaatcagtc 

13201 ctccagtccc agtcccagat ccatcactgg cttgctttat ctcagttgtc ccctggggaa 

13261 aattcttgtt tttccagaac taggtttggt ggtgcaccta ccctcaagct gtcacctgac 

13321 ggcagcattt tgatgtttga ctgcccctgc acgtttagtt caatgccacc agatggcagc 

13381 atggagccag aaaccatatg acaagttgca aaggccaatg aacatgaaac aacataatgg 

13441 catgaatgtt gctttattaa atatgcaact ttcacatgac gatcatgtgt agccttgtga 

13501 catttctcta attgtattgt tactcttgtc ttacctccac ataaatgaaa atctggttca 

13561 actcagggat tgagtttatt tgtggttcct cagtgctttg catacaaagg gatgacaagt 

13621 ttgtgacttg aaaattaaat catcattttg ctttatacca gtagttttta actagtagag 

13681 gttttgtgcc ccagaaaata taaacagtct ccaatttgtg atgattcagc ttacaatttt 

13741 tcttttcgac tttacaatgg gtttactgaa caattggtaa gtccagtaca atcggggcta 

13801 atgatggtta tgattccagc ttacagtggg cttatcagga tgcaattcta tggtacatca 

13861 atgagcatct gtatttggca aaaaatgtct ggagacattt ttaattgtga caaccttggg 

13921 ggggtacttg aaggtaaagg ccgaggatgc agctaaacat tcttcaatgc ccaggacagc 

13981 ccaaccccag ctcccatccc acaaataatt atctgacagc ccaaaatgtc aagagtttga 

14041 gaaaccctga tccagggcag tgggtttcag cttgctttta gcatgggagg ctttttcaaa 

14101 tgaaatctta gttagaattc caacatataa acttgatttt taaaaagcag aactgctctg 

14161 gttgaattgg gagtgcagga cccagaacca ccatcggctc aatgccctct accctctgtc 

14221 caggaacctt ttggctccat acatcactgc ttaaaaagca tctgcttctc aagttagaga 

14281 aaccccttgg attataggga agtaatgaag cccaagcggt aaagagactc aagtagggcc 

14341 acacgacgat ggatcctgtc tctcctctat aagtgaatgg actttgattt aggcaagaaa 

14401 ataaaaacca ttacagccct tctgctgccc aaactataat caacttagaa aagctttggg 

14 4 61 ctctgggcca gatgtggtgg ctcacgcctg taatcccaac attttgggag gctaaagcag 

14S21 gcagatcact tgagctcagg agttcaccag cctgggcaat atggtaaaat ctcatctcta 

14581 caaaaaaaaa aaaaaattag ccaagtgtgg ttgtgtgcac ctgtagcccc agctactcgg 

14 641 gaggctggga cagcaggatt gctcaagccc aggaggtcga ggttgcagtc agccatgatt 

14701 atgccactgt actccagtcc aggcaacaga gcaagatgct gtagcaaaca aaacaaaaca 

14761 aaagtagaag cagcttccag ctctgtagtg tcaacagtgg gcagagaatg gtccagccag 

14821 cggaaggcca taaggatgcc aggctgggtg ctccttaagg gcagggcccc cgtcaccttc 

14881 atattcccca tggtcctggt aagccacagg tgcttagtgt ttactaacat gagaaaagga 

14 941 cagcaaccat aaagggtgca gtggttaaat ttcagaccga ccccacacaa agctcccact 

15001 gccagggttt ggtaaagtta gtgtcgcaga agatacaaaa aagcatttcg ataaaacctg 

15061 aactgccaga cttaggagta gatgaggtgg caccccacca cccccaacct gcatctcccc 
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15121 aggcccccgc tatgagaatg ggacttagag gagttcagag acatgcctca agtcacgtga 

15181 cttctaagtg gcagatcaga ttcagggaca tctgaattca taacgggttt tagggaaggc 

15241 aagactggca gcctgagaga catagggagg gcaactgtgt cgttctccct gcagaggggg 

15301 gtcttgagta acttcttgtg agaggaggtc aagaaaaccc acaagaatgc aggggagctg 

15361 ggccaggcag agcacggtgg gaaccatccc tcgcttcagc ctgacctcct cagagcacat 

15421 tttctgagcc agatctgccg tctttgataa gctgcccagt gtatctgaac tggctgcagc 

15481 cgcggctcca tccagcgaga ggccctggca gccttcctgg gctgtcgctc tctcggccgg 

15541 gcctggaact agccaagatt cgcgatttcc accccaggat ccgaggtctt gcagaaacta 

15601 aggctcacaa ggccaggcct gagccaccga ccaaggggaa ggacacagag actgccttcc 

15661 tcctgcagga ggtggataat atagagcgtg ccctggggga ggtggggtag atagagcccc 

15721 agcagggaag ttaggaaacc tgctctccaa ggctgtgtgg tcctggacaa gtgactttcc 

15781 ctctctgagc cttcatgtct tcatctgtac aatcagggca gaggctagag cttgctctag 

15841 attgtgactg gaagtcgtgg ttaccaagct cccctggcac aatcattcat ctaaacaaac 

15901 ctgtggagtt taaagaactg gattctttgg cagctgctgt gctgctcaca actcactgat 

15961 gctggtcccc ctctcccagt ccccttccag ctcctgggct cctgtgttcc tcccaccctg 

16021 tgctccagag cgcctctgtt taccgcgccc tttctcaatc acatgcccca gtccctctct 

16081 tcctgcaagt tttcccagga agccaaactc tggagcagta gtactaaaga ccccagaact 

16141 tttccaggca tgtcttccat agtctccctc ctccacggac cccatgctcc ttctcccatt 

16201 cacccaggaa gattcaccct gcccccacca cgccagcttc ctcagacctc cagctctatg 

16261 ctgatgctgc ttacagccaa cacagttgca gcctcttggt tcttggtgac tggctgcagc 

16321 ttctctgctt tgtccccaac acttggcctc ccagttccct tctgtccttg gtcactctgt 

16381 ttcctcttaa gccactctat tgctggtctt tctcatacat cacacacact tgtcccagct 

16441 ccaatagtca catcctgtga ttcctccctc cccactttct ctggctaccc acaccctgcc 

16501 agggccttga gacttagaac atagaaatta aaagaattta attccaatga actcaaaatc 

16561 cctgaagctg aaaccagagc attctttgtt gcttgagaaa gggtagagga gtttagggaa 

16621 gagagaacta aacctacgaa agacagtcac ctttttccac atctcgtccc tcttgctgtt 

16681 gaagtcgcct gtgcctggga tgtccaccag cacgacccct tctgggatca ggtcggattt 

16741 gggaagtgtc acttccacat gtttgatcaa gggccagatg cgcatctcag cggcctctcc 

16801 atcccaatct ctcctctgtg tgcggatgta ggggtccagc ttgatggaca gctcttctgc 

16861 ctgaagaagg aggacagagt cacacacacc agccatgcac ccagctctgt tcaccccggt 

16921 tggctgtgat gcaggttcat ggtaagtatg ctgagtaagg gtaagaagtg cgtccaacag 

16981 gaggcctgca atcaggtctg taggacctag aacctgagag ccaggaagga ccctggagag 

17041 gagcaagttc aagatcctta ccttacagat gaaacaaaac caatgtcagc taaaatggga 

17101 ctttagggaa gttatagtat tgctgaataa taagtttatc ttctaaactg cattcagttc 

17161 taaaattaac atactatgta attgaYattt tcattaccag aaagctagca atgactatag 

17221 aaaattaaca gtgatatggt aaccatattt tccaaactga aaattgatct gaaattggat 

17281 cttttttttt tttttttttt tttttttgag acagagtctt gctctgttgc ccagRctgga 

17341 gtgcagtggt gcgatctcgg ctcactacaa gctccgcctc ccgggttcac gccattctcc 

17401 tgcctgactc cccaggagct gggactacag gcacccgcaa ccacgcccgg ctaatttttg 

17 4 61 ttgtattttt agtagagaca gggtttcacc atgttagcca ggatggggtc gatctcctga 

17521 cgttgtgatc cgccctcctc agcctcccaa agtgctggga ttacaggcgt gagccaccgc 

17581 gcctggccga aattggatct taaaaagagt ttttaaaatt tatcttctga ttacaaaaaa 

17641 taccttgata tgtaggcatt tctagaatat tttaatgatg gtgggtgaga gagggaatag 

17701 ttcttagacc acggttccta cttaaacatg tctcgggaca atqacacatc catgccagca 

17761 actatgacag taaatcgcaa ctggggtact tcagaggaca ggggatttgg gatgtatggt 

17821 ggtcactgtt aaataaattt ctataaaata gtcctgaaag aaagataaga atgtcagttc 

17881 ttaagagagt ttaaattaaa tcgctggatt tgttgcaatc tggggaagga gtaagagatt 

17941 tcatcaattt caaccaaatg ataaaccacg acggaggaga taataaaagc tctttaagac 

18001 attgtaattt taaaatagag cagataccgt aagagagaag cccaggaaaa ggcaggggct 

18061 tgaaaagccc cttcctacaa gagggctcag cccaataaac tggttccggg caaacatcgt 

18121 tgtagaaggc cttgattcta ggatctccag tggaaaccct gaatttgtaa gcccactcat 

18181 cgagatggta attctgttca ccttgtttcc gtgcgtaaat ttatgacgct gggaaggagt 

18241 atttctaaat atagagttgg caaagacgtt gatacccttg tgtaagttgc gtgtattcag 

18301 ataaagcaac tagttttgta atcattgtgt ttcttagaca tgctcagtct atgttttaga 

18361 tattagaagt gtaatcattt taatagtaat attcgtctta aaactgtaag aaatgggcac 

18421 tgtatttgtt tgattttcca ggtatgcaag agatagtaac atgctttgaa aaggtttcat 

18481 tcatttgatt tttaagtgtg cctagtcaag catttaaaca ttttaagagg ctcaagtata 

18541 ttgaacttta actgtaatgc aaaaagccta aggaaatgtg attaattatg taagtatcat 

18601 taatatttaa ggaagatttg ctttgttaat caggagatca ttgaacattt aactttagga 

18661 catgtataaa aattgtaccc attttaccaa taatcactag atttataaat agaatagcaa 

18721 tattttaaag gtgtatttaa aggtgagaga tacaagaaaa actaggtttt taaatgtata 

18781 gctttcctaa attgaatgtt gattccacaa atattttgtt ttaattcaga aattgaattt 

18841 taacttaaaa taaacattgt ttataaaagc aagaatatca ttctactagt tgcctatcga 

18901 atgtccatcc tgtccctctt ttatgccagc agaatcctga ttttctgtct gactggggag 
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18961 gcagtggagt gaaactggtc taagcaaatc atgacaagtc tgcttttcac tttctctgga 

19021 ctccctttca gccaggaagg gctgtgtggc tcagttctga actctgaaat ctacacagaa 

19081 atttgtcaga ggtatttctg ggaaagtttt tactttctga tgaaagggga gatgtggctg 

19141 atgccaacct tttctccact tcttccttcc aggaggctag catgaagtct agcactccgg 

19201 gagccatctc atgacaatgg gggcgacgga aggcaaaggg agccagagat gctgatgttg 

19261 gcattgctga tccactaaac caaccccacc cctactgccc ccaaacctct tacatgagaa 

19321 aaggaataaa tccctgtttg tttaagccac tataagtctt cttaaaaaat ctttggcagc 

19381 ccaaactatc cctaattatt ctcaagcaac tgtaaatagt ctttttctgg gcataagagc 

19441 tcctccattc acataccaaa aatttccatt tgatagccat aaataatgag gataacttgg 

19501 gctacggata cttctaattt gggtagcaac ttatttctta tttcacacaa acactgtacc 

19561 acaagcccat cccctgaaac aggttcaact gttagacttg agacccagga tgaaatcaag 

19621 gaaaaaacga acttgacgct acctcttccg ccttgagggt gatgactctg gaggtgggga 

19681 tcttcctttt gggcttcgcc ctcagtaact cctcatagtt cttactctct gccccatttc 

19741 cataaatcat ttgtagcttc caggtggctt cctccactgc ctcatccctg ttccacgcat 

19801 ctgcctcttc tctgctcagc tcctccgtcc tatgcaggag tttggtcagg ttcttcagct 

19861 cctccctcca ctcctgccaa ggcagagtag ggccaagccc ctcttgacac agacacagag 

19921 aacaccgaca aacagagggg cccattgccc cctgtggagg gccagggggc tgcatattca 

19981 gaaaatctag gtcatctatc tggggtctcg aagaggggaa tggcttgccc aaagtctcat 

20041 aaaaatgcaa ggaccaggct agaccccctc cccagctcac ttgatgaact ttctcctggg 

20101 atgggctctt ttcattcaac aaaggcatga gctgtatttg caaggaatat aatgttatct 

20161 ctgagagaca gtgagaagga aagaaagaaa aggaagaaaa aaaggaagag aaggaagaag 

20221 gaaagaaggg agagagggag ggaagaggaa ggaaggaaga aagaaaggaa ggagggaacg 

20281 aaaaaggaag aaaggaagga aaaaggaagg aagggaagga aggagggagg aagggaacac 

20341 ttacctggtc agacagaagg tggattttgg cctcatactg cacacagcag ccagagctca 

20401 cttgtacaat gcaggaagta catatgcttt ctccagacac tggtagaaac attgcttgct 

20461 ggatgatggc attgatcagg gagctcttcc cagccccagt gcttccaaat aatgcaatgt 

20521 agattgggtc cactgtcggc ttttcaatca aggcaagaag cctatttctg gatgaatttt 

20581 aaaatgcaca ttgccatgat catcaaagtt tagagtgaag agcattgaaa cagaatgcaa 

20641 aggaactgaa ttctaatccc gggcacgtca ctacaacctg agtctctgtt ctctcctctg 

20701 tcaaaggaga atatcaatcc ctgttgacca cacagagttg gcctggggat caaatgagat 

20761 tatgtaaaaa tatttggtaa tgataattcc acacaatccc tgtaatgtct atatagatgt 

20821 gaggtgtctt tatttgtatt cactggaaat tgcatattct tttttttgtt tgtttatgtg 

20881 tctgagacag gatctcactt tgtcacccag gctggagtgc agtggtgcaa ttatggctca 

20941 ctgcagcctt aacctcctag gctcaagcga ctcccttgcc tcagcctctg aagtagctgg 

21001 aactacagqc qtgctccatg acgcctggct aatttttttg tattttgtat agagacagag 

21061 ttttgccata ttgctcaggc tgctctcaag tggtccaccc gcctcggcct cccaaagtgc 

21121 tqaqattata qqtqtcagcc actacacctg gacagaaatt gtatattctt gatcctagaa 

21181 atacaatgta tacagacatg aagtattaat agtgttagga gcaacacaga gtcttactga 

21241 gcaggagaaa ttgtagcctc agcaccactc gagtccttca agtcagcaga ggacaaataa 

21301 ttggggacat gagctcagag tcacacagcc tggggtgaat tctggaccct ctcactgact 

21361 actggtgtga gtttgttcac atctgggcct ctctgagcct caatttcagt aaaatggaag 

21421 tgatcataat agctcctact ttataggagt gttggatggg ttaaatgata tttgacatat 

21481 ataaagcact tagcagatta cctaacagtg acagctcaat gaatggcaac tatcccatag 

21541 ctacagaaat ttgtgtccct cttttcttag tttctaatta aatagtccct ttccttgttg 

21601 ggggaaagca cctataatcg cctaaggaaa gacactgaca taaaatttca gggttgcaga 

21661 tttcagactc acatgagata cttgactcca ttagggatgc tgtcatccag gaagacagac 

21721 tgaataagtt tctgataagt gttgctcaaa acccttctgg tccgtgattc caatttttca 

21781 tctggaatta acagagagat gtcacctgca ggtacctggc agggataaag tactaggtct 

21841 ccaaaggagg tgtctgttga acatcgcaga gaagggaaaa aaaaaatgaa atgaaaagcc 

21901 gattaaacaa gatagcaggt tatctcacac gagggatggc ggctcactgg agatagcaat 

21961 cattggttct gaaaatctcc aaagagccag gctgctctga aactctggta acagaagtta 

22021 tggtatgctt ggccacgggc ttcagcccct tctagatttt tccctcctgt ggtcataaaa 

22081 agaaagaaaa aatgcaagga caccatgctg acacccacgg aagcaccaac ctcatcattt 

22141 tcacaggctt tgttctaatc tggaagaagt tattttgata acaaccctcc tgcccacaga 

22201 tgcctggaga caaattgcct aactgcatac cgctacagag actttgagcc ctctctttcc 

22261 aaaacaggtt tatttgatgt ttcatccagc ttcagaattt aacctcctct tcctttgacc 

22321 attttaacat acttggaaca gaaccgaacc atgttaaaac aacagttcca agacactcac 

22381 attccttaag agcactctgc tccatggagg gaaatgctcg gaaccgctgg tctcgatctg 

22441 atttccttct ttttctcgtt cgttctttat ataaatcatc ttcaactaga gataaacaga 

22501 ggatatataa gattataggt gttggggtat ggtattcttg cctcacctct atcagcatag 

22561 aaactcaaag ggtctaacag actggctggg tgcagtggct catgtctgta atctgaagca 

22 621 tgagggatta catgcctggg aggccaaggt gggaggattg cttgaggtca ggagttcgag 

22 681 acatgcctgg gcaacacagc aagacccttg tctcaaaaaa aaattttttt ttttggtaga 

22741 gacaggggtc ttactgtgtt gcccagcagg ccagagtgct gcggcatgat cacgactcac 
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22801 tgcaacctca acctcccagg ctcatgtgat cctcccacct cagcctctag agtacctggg 

22861 actacaggca cacaccatca cacctggcaa atttttgtag agacagtgtt tctgtatgtt 

22921 gcccaggctg gtctcgaact taccaaaaat atatatatat atattttttg attagccagg 

22981 tgtggtggtg tgcacctgta gtcctagcta ctcaggaggc tgaagtggga gggtcacttg 

23041 agcccaggag ttgtaggctg cagtgggcta tgattttgcc actgcactcc aacctgggtc 

23101 atacaagacc ctgtctcaaa aaataaaaaa ataaaaaaat aggaactgct tccagctctg 

23161 tctgcccata aactcatttg ggaaaaaaca ctgttggatc attccagttt ggttcctact 

23221 tctcagagtc acaagactgt ctgcatttag ttccaaaatg tagagttcaa tctatctccc 

23281 cagataaata gtttctattt gcaactcctc agaagccaaa gtggctggac attcttcacc 

23341 aaatgccaag tcttaggaaa tcaaccaact gtgtagagaa aagctgaaga cggggttaag 

23401 tgtgcagtga ggaaaggcct ctgggaggtc tttccagact aggaaagaaa gcgactagaa 

234 61 gcagaatgca aaggaaaaca ggggccaggt gaccagaaaa aatcacatct tcttgagcct 

23521 caatttattc atctggaaaa tgagagaact gacaaaaaaa atacacatgc attacaagag 

23581 tttggatgga tgcccttcag agatccttca aaatttg'ggg aaattttttc ccttcttcac 

23641 ttggagacat cagcagttgc ttccttgcat aataagcaag aacagaaatc acctagaagc 

23701 agatctttgt agctctgcgg agattcattc acttaagcag tttcccttgg ttatgaagtg 

23761 aagggcttgt tgggcccatc atcctaaaga aggggtataa ccaaaatgtt gtgttttaga 

23821 tggattttag aaaacacccc ctggtcatgt ttacctggct agaaaggggc aagaggtgga 

23881 tctttggggt gaaggttgtt gaaaatggtt ctaacccagg gctcctgggg ctggcctgca 

23941 ccttagaaaa tggccatgag gtgaaccaaa agagtagatc cccaagaatc acagggaggc 

24001 catgaggctc caaaactgag tgtgacattc tgaggtcatt ccctgcagaa tcacagagaa 

24061 gaaatgtcac gtggcatcca agaacggcag cccacacaga aggcccagtc tctgcctgag 

24121 ggctccagac acagatctcc cagacaattc aggcagccct ccctctctgt tcaaccttgc 

24181 atcttccagt agaaatggaa gccaacgcac accccgtaga atattctagt gcaagccatg 

24241 ctatttgcca aggcctgaat ctctgccacc tcatgctcca cgctcaagcc acaactgaga 

24301 gttgagagag gttatacttc gacccttccc tctcagacat tctaactgat gattgagctc 

24361 aaattcaaga attcgccctg gctcggtttg aggactgcct tagaactaaa taccaagtgg 

24421 ttgctggctt ggtggctgga gctggagagg aagaggcaaa gaaagaagtc tgctggtcag 

24481 gcccctaggc cagctaccgg ctgaagaaca ctttaatctg tgctcgaaaa agcatctcag 

24541 aactccagga tgtgtccaca acctgtctgt ggaaagagtg gaaagggatg tagactttgt 

24601 aactcaggaa ctcttttggg taaacaatgt ttgcctaatc tcttagccat gtgtataaaa 

24661 caagggccaa gtccacacat ccagggccct tatttcaaca ttgtaaggac tggcaacgaa 

24721 tctgataaga gtttggaagc ttatttctca actaaagcaa atgctaggaa gtccaataca 

24781 gcagggttta aaactttttc ttcaatttct aaaatgcaaa actatctgga aaggagaaag 

24841 tattccatct tggatttata ccatacgagg gtagtatttt tatattaqat tgtaaccaaa 

24901 ataaacttgt ccgaatttag atcacagatc ttattattct tgtccatcgc tcaagccctt 

24961 ccttgccctt tccagcattc tccaccgaga aaccagctgt tggggtggat aqcqqcagtc 

25021 caaaaagcSt agtatactgc tgctgttcct tcttgctgcc acaaggtgtc agttttgaaa 

25081 actgatcccc caacagcctg agctgagatg Yggtttcatt gttgactgtt gaaatttttt 

25141 tttttttttt gagacgaggt ctccactctc gcccaggctg gagtgcaatg gctgggtcac 

25201 ggctcactgc agcctcaacc tccgtggctc aagcaatcct ccgacctcag ctcccaagta 

25261 gctggaacta caggcactca ccaccacgcc tggctaattt ttttttcttt tttttttttt 

25321 ttccagtgat gaggtcttgc tgtgttgctt aggctggtct tgaactcctg ggctcaagag 

25381 atcctcctgc cttggcctcc caaagtcctg ggattacaga catgattcag accattgcca 

25441 gaaactgcca gaaatcttcc aagcaaaacg atcccattcc tctaggctaa gctcaactga 

25501 ggtgacccca aagtggcagc tcttcctata tgcctgctca ggcttgcttc agatgctagc 

25561 atcaccccca ctaacccaga aacactgctg cactttgccc ctaaggaaat ccatcccctc 

25621 ggagtgagga aacattccct cttcagttct caaaacacaa aggatatgat acgccactgt 

25681 tttctgcaac tgggggttat atccctgtca tcgtcctata gggaagcctc ccttattggc 

25741 tccaggggcg caataccagc cttctgaggg agggaagcca cccgcactga ctccaatgtg 

25801 agtcaccctt ccaggagccg ctccaaggca cctacatctc tccttctacg gaaagccgcc 

25861 taacgtcatt ccaggggtct tgtaagacag gctacctgac ttggacagag agtccaagct 

25921 gactttctgg aacattatta ttattgagat ggagtctcac tctgtcgccc aggctggaga 

25981 gcagtggcac aatcttggct cactgtgacc tccgcctcct gggttcaagt gattctcctg 

2 6041 cctcagcctc ccaagtatct gggactacag gcacacacca ccacacccgg ctaatttttg 

2 6101 tatttttagt agagatggga tttcaccatg ttggtcaggc tggtcacgaa ctcctgacgt 

2 6161 catgtgatcc acccgcctca gcctcccaaa ttgctgggat tacaggtgtg agccaccatg 

26221 cccagcctgg aacttttttt ttttttttta aggaaaaagc cgcacttatt taactgctgg 

26281 ccaaacagtt cacttcatta tgaacaggtt atttcagtta ctgttgaaat atctgatctt 

26341 cattcactat ttaaacttgt cacgagaaag caccttctga ataaagagta aggaacaatg 

26401 acaacagtga aatatagtcc caacctattt gccttttatc ctatatcagt aatcagttcc 

264 61 tatatcctcc atcagaagat actgcaaaga tgttacctgg atgcggttcc tggccaaaaa 

26521 catccttcgt ttctgccatt ccttgttacg tgaactcctg ctcttctcag ttcaggagaa 

26581 ccagcctgtg aagacaaagt acaaagacag gtggggcagc ggaatttgaa aatacagaaa 
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26641 gcaaacccta aggatgtgtc agtctgcttg tgcggtattt ctgcccattc caacgtatct 

26701 ctgaatccct cacaacacct agtacactac ttgatggaaa tataacaaag attagtgttt 

26761 gaacactggt ttccaataat acttaggcca ggcgcagtgg ctcatgcctg taatcccagc 

2 6821 tctttgggag gccaaggcgg gaggatcact tgaggtcagg agttcaagac cagcctgacc 

2 6881 aacatggtga aatgccgtct ctaccaaaaa tacaaaaatt agccaggcgt ggtggtgggt 

2 6941 gcccataatc ccagctactg gggaggctga ggcaggagaa tctcttgaac ctgggaggcg 

27001 gggcttgtag tgagccgaga tggaaccact gcactccagc ctggatgaca aagcgagact 

27061 ctgtcttgag aaaaaaaaag agtctaaacc agtattacct gagagcttta gggtaactca 

27121 cgctgaaatg agaataagcc tctggtgacc agagctgtga aggtgaattt atatagcttc 

27181 gctgctttgg gtggcatcag tgctccactg gggcaaggct tgaaccctga gccagtgcta 

27241 tttagttcca gaaagtctca gcaggaatct gggaggtgga acattagata ttcctagagt 

27301 tccctggagt tgctttgggg ggctgctgac tggggaagga gatgcagtga cttggttccc 

27361 tggaccttcc tcaccaagcg caacttccac caggaggaca cccctcctgg gctctccttg 

27421 cagcttgcag catggagcat ctgcaggagg gagaggagag gcacatccaa aagaagaagg 

27481 ggcagatcca gttggtctcc agtccaccag cagatagggc ttggacatag agggagaact 

27541 ggacccgtgg aggctcagaa cagagggcac tgcccagaca gggtagaatc tgctagcctg 

27601 gagctgcttc atctccaccc atagagaaga atttgaggaa cacgagtggg gaacacccaa 

27661 ggtgcttact tgccgttacc tgatgtttct gtttggatca ctcagccgtg tatttcaaaa 

27721 gcagagtcac cagcccttct gaaatatgca aaatcacctt tgatctgtcc tactagaccc 

27781 acaggctgtt atttagggaa actatgtctc atttacacat taaccccaaa ttaaaatcac 

27841 cattcatcag ttactcttag tttggtttgc agatattaac tgactaaacg gcttctgtat 

27901 ttggtgggtg ttgggtgaat ggggtgagaa cccctattaa gtgcaagatt ctgagtctta 

27 961 ggtagagctc tttccctctc ccttcctccc ctctgcagcc cattttcaga tcatcccaca 

28021 ctccttacag ttctccaaag cagggtatcc tattgaactg ctgagcattt tagcacatgt 

28081 ggggtgccct tcccttcttc tcgcaggtga attcctgctc ttgcttcaaa atcctgttct 

28141 agtgtttcct ctgtgaaacg gtgtctgatt cccactcatt ccccccgtgt gtccccggct 

28201 ccatacacac ctgtgttata acactgactg tgctgtgctt acgtctgttc ctcctggttg 

28261 aaccactcga ggtcagagga ctgcattttt ttcagtggtg catttctatc aagaagggca 

28321 atgactaact aaacgttagc tgaatgaatg gataacataa tggatgggaa gcttttgatg 

28381 agattccaca ccaaaggatg tttaaaagct aaatttgttg ttttggtctg gtggtttgtt 

28441 tgcttgtgac agagtcttgc tctgttgccc aggctggaat gcagcggcac aatctcagtt 

28501 tactacaacc tccacctcct gggttcaagc gattttcctg cctcagcctc cagagtagct 

28561 gggattacag gcattgcacc atcatgcctg gctaattaca attttagtag agatggggtt 

28621 tcaccatgtt ggccaggctg gtcttgaact cctggactca agtgatccac ccacttcagc 

28681 ctcccaaagg gctggggtta caggtgtgag ccaccatgcc tggccctgaa ttcatttttg 

28741 gtatgttaga gtccttgatt gaagagtaaa gcaaagagta gaaataaata gtcaagtatg 

28801 caaatgaagc Rgcactcaca gtgaggactg ctccacgggg atgaggaata tttgatgtga 

28861 ggWagggaag acacaggtaa aggcagaaga tgggggcagg ggccttcatg gtggtcttca 

28921 aatgcattac cttacagctg caaataaaag agacatcctg catattttct agggttccca 

28981 gagggtagaa atagatccac taggtggcag gtacaagaaa. gcattttggt ttaacattct 

29041 caactaatca gagttgtcca aaatggaatg agctccctta ggaaaccgtg agttttctgc 

29101 ctctagaggt gtctgaacct atattgatat gaccacctaa taacgacatt gagatagata 

29161 aattttaagc cttcttcaaa tcccaagatt ctatgttctt acttgaaatt ttcaccaaga 

29221 aagctggtag tagaagtgcc ctatgaggtc accaaatttc acaagaggca caagcaattt 

29281 ctggtttgga atactgagtc agaaatgctg acacacatca aaacctaggt gagctacaag 

29341 aaaaaaaaag gtagatgggc tataacatga gtaaatatac cataatgcat gtcaataatt 

29401 ttaggcaggg tgtgatggct catatctata atcccagcac tttgggaggc tgaggaggga 

29461 ggattcattg agccctggag tttgagacca gcctgggcaa tcaagtgaga ccttgacact 

29521 accaaaaaaa aaaaaaaaga aagaaagtag ccggttgtgg tgacacacac ctgtagtacc 

29581 cactatttgg gaggctaaga tgggaggatc ccttgaaccc tggagttcaa ggttgcagtg 

29641 agctatgatt gcaccactgc actccagcct gggtgacagc aagaccctgt ctcaaagaaa 

29701 aaaaaaaatt agatacagaa ctaggattac ccctaggaaa aagatctgtt gttttctgaa 

29761 gagagcagct taaggagtaa ctcgtatgaa atgctaggca tcatattatc acagtgagca 

29821 ctggaagcaa aacaaaatgt atgatacaac cagtgaacaa atccatggtc gacctccagg 

29881 acagctggtg tgctggaatt caaattctag gaggaaacag tggagccgga taaagtacaa 

29941 aagaactgat agaccaagtt gataagagtc gtggtcatac agacactaag attagggacc 

30001 ttcaattgag agagctgtgg attaaagggc acaaaatcaa agggccttga gtttagagca 

30061 gaagcagctg tggtcactga attctggaat acgggtagta gttttggctt ctttcagaat 

30121 cacaagaact ttagttcatt tgacttcatt tactaaatca acagtacaca gtgtccagca 

30181 agggccgtta gcatgtaact agctggggtt ggtttgggtt tagctgcatg caatgaagac 

30241 agaaaaccac acaacttcct ccgtagcagc atctatggtt tctgtataca agtaatttgt 

30301 gaactctaac tgaaatttgg aatcattgga aattatgtgg tttggttgtg gtggattggg 

30361 cttcaaattc ctttggccaa tgttcaaaga ctggcaggaa gggatcattt gatgaccaaa 

30421 caaaaacact agcaaatgga aagggatcat ttgcaaaaag gctaagagga cagaagaaaa 
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30481 taaacaccaa gtacataaaa aggaaaagcg ataaacaaat gggggtagtt tggtacagac 

30541 attcagtata tagtcattgg gttgaactaa atgatgaggt gggaagaaca ttaggctgga 

30601 aatctggtgt ggaagcttcc agctccagct ttaccaccca tcaataaagc tggaggactt 

30661 aagacaagtc gttttatcac cctggcctca ggtttcccat gtgtaaggca aagactgaac 

30721 tagtccttaa agtgtgagtg ttctctgatt tagatgggtg ctaaaaagac acaaattctc 

30781 attttgaaaa tgtcacctgt cattgaaaMt gagaccgtac caatgattag ggcttccgtg 

30841 tggccaagag cagtacacca agagcactgt gatttcaagt gcattgtatt aaggagaaaa 

30901 tgagctacag gagtatagtc cttcttcaaa tcatcctgca gtctatttcc ccccaaaggg 

30961 aaggaatggt ggtgttcatt gtgaaaattt ggtttctgtg gagaagcaat tgataacagt 

31021 ccaaatttat tgctcagata agcaaggttc tgcctgtgtg gtttattttg cttgtatttc 

31081 agctatgtgg ttagcactga agttttgctt ttagtgctta tttacattca tgtgcactga 

31141 acctaagaat agcatctttt aaataagact gtgcacattc agctagatat aggaaggctg 

31201 cacagaactg atttcctgat aaataattgt aacttttcac agttattctt gcttctctcc 

31261 ccctcctttt cttctgtttg tgtttgctta ttgttttttt tcttacagga aaatcacaga 

31321 ggcaacacaa acgcaggtca ttcctgtact agaacagctg catatcatgg cattgaataa 

31381 agaatgcggg ccgggtgcgg tggctctcgc ctgtaatccc agcactttgg gaggctgaag 

31441 caggtggatc acaaggtcag gagtttgaga ccagcctgac cagcatggtg aaaccccatc 

31501 tctactaaaa atacaaaaat tagcccggtg tggtggcacg cgcctgtaat cccagctact 

31561 caggaggctg atgcaggaga attgcttgaa cccaggaagt ggaggttgca gtgagccgag 

31621 atcacgccac tgccagcctg ggcaacaaag cgagatttcg tcttaaaaaa aagaatgcta 

31681 gggccccttc ttctacagat ccagaccctg aggaccagga aggggaagaa acttgtccaa 

31741 ggtcacatag taagattatg gcaatgaggg ctgaaactca gttttcctga ctccaagttc 

31801 aatgttcttt tcatcacatt cagcagcctc taataggcac tatttaaaca agccccctgg 

318 61 agctcaggca tagggcaatg gaaggagtag tgcccgcaaa ggagggttcc ctcaaatgca 

31921 tggcaaaggc atttcatgag aaggaaggct gttgagtggg gagagagagt agcaacttga 

31981 tagaattgta tgcaggaact gaagctgaag ctactgcctg caaactgctt ttttgttttt 

32041 gtttttgttg ttctttgttt ttttgtttgt ttgtttgttt gttttgagat ggagtttttg 

32101 ctctatcgcc caggctggag tgcagtggca catgatctca gctcactgca acctccgcct 

32161 cccaggttca agcgattctc ctgcctcagc ctcccgagta gctgggatta caggcacctg 

32221 ccaccatgcc cggctaattt ttgtatctct agtagagaca aggtttcact atgttggcca 

32281 ggctggtctc gaacccctga cctcaggtga tctgcctacc ttggcctccc aaagtgctgg 

32341 qattacaqqc atgagccacc atgcccaacc ttactgcctg caaacttgaa gacaccctca 

32401 gaagtaaaac aagccagaga gagagagaga gactatcaga ggaatacatt ttcactgcat 

32461 tttttctqtq ttcaatgcaa aattcaatgt gtttattaca aatgacaaat gtccttttct 

32521 tttcccctag ttattctcct taggcagaat ctcatttcac cctaccttgc ctcgcatcat 

32581 taacccttgc caacactatt tacttactta aatttaaatg cttaaattag caaagLLLct 

32641 caaaggggca ttttggtcat taacattggt ttactctcta cttttaataa atatgggaag 

32701 tacttggtac acagcaagca ctaaataatt tgttgaatga atgatgatag atataatttt 

327 61 ttgtatctat atctagatag atctatctag atctagatat agatatatat cttagatggt 

32821 aagatcaaga atagcagaaa ttatatttac tattttattt atccatccat tcattcaaca 

32881 aatatttgtg agtgcctatg atgcaccagg cactgttcta gatactgggg gtacagcagg 

32941 aaacaatgtg gtcacagtat ctgcaaacaa agggcttcca tattccagat gtgaatggtg 

33001 gagcaaaaat tattagggat ttggagtcag aatgcctgga tttttgctcc tgctcccaat 

33061 cttcaatgca acacagagca aatcacctga tctctctgag cctctatttc tccacatata 

33121 ggatagaatt ggttttaaaa aaaatacctg tatcaagaat tcattggcca ggcatggtgg 

33181 ctcatgcctg taattctagc accttgggag gccaaggtgg gccgatcact tgaggtcagg 

33241 agttcgagac cagcctggtc aacatggtga aaccctgtct ctactaaaaa tacaaaaatt 

33301 acctgggcat ggtgcagcat gcctgtaatt ccagctgctt gggagactga ggcaggagag 

33361 tcacttgaag ccaggaaaca gaggttgcag tgagttgaga tcgcaccact gcactccagc 

33421 ctgggcaaca gagcgagact ccacgtcaaa aaaataaaaa ataaaaaaga gttcgttgtg 

33481 agaattaaat gtgataatga atgcaaaatt gctctgcagg cacttatgca tctgtacaaa 

33541 tgttattttt tgaatatttt atgaaggtag ttaaactaga aatcagtact agtgtcatcc 

33601 aggccacgtc tgttagtctt tggggaacag atgccacctt agctgagatt ctgactcccg 

33661 ttcttgtccc tattgatcaa acacattgag gatgctctac catcttcctt aaagaatctt 

33721 cgatcagctg ggtgcagtgg atcacacctg taatcccaac attttgggag gctgagccgg 

33781 gcagatcatg aggtcaggag tttgagacca tcctggctaa ctcggtgaaa ctctgtctct 

33841 actaaaaata caaaaaatta gccagccgtt gtggcacatg cctgtagtcc cagctactca 

33901 ggaggctaag gcagaggaat cgcttgaacc cgggaggcgg aggttgcagt gagctgagat 

33961 catgccactg cactccagcc tgggcaacag agtgagactc catctcaaaa aaaaaaaaaa 

34021 caatctttgg tcaagttttt tcttgctctc acaagaaggt gagctgaaga atgaaagaag 

34081 gaagaggagg agactgttaa tgaaatgata gaatgttgat gaaatttaga atttatttcc 

34141 tttctctttg agtttgggtt taggtttaag taactttggt gttggcaaca tttcctaacc 

34201 acctcgttgg ccactgcact gctcttccaa actagggatg tcccaaggca gcaatttaaa 

34261 cacattagct cagaagcaaa agtgaaacaa gggcctccaa atctctaaag gaaaagaggt 
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34321 atgttttcag cattaacaca tcggctaatg tgttcagaat tagaaatgtg gccaccagaa 

34381 gcagtcgtag tagaagggag gctttaggat gacatgtttt tctccatctt tcttccctag 

34441 gtaaagaggt aagcaaaata agcagatatt ttcataacgt actttagcag attctagcga 

34501 aacgaatttg aatcttctca tattgctctc ccaagcttgt taaagctaat gaggcataag 

34561 atgacacttc cctccttatc aggaattcat acacctgcca gcaatttctg caatagctgt 

34621 tttcttttgg gaggaatgat taatgacttg cctaagatca cactgttctg taaaattcaa 

34681 actaaaacct gattctcccc atttctagcc cactggcctt tttacatagc cttaaaattc 

34741 acttaaacag cctgagcatg gtggctcatg cctgtaatcc cagcactttg ggaggctgag 

34801 aggagagggg atcccttggg cccaggagct tgagaccagc cttggcaaca aagtgagacc 

34861 ctcatctcta caattttttt tttaattagc tgggcatggc agcatgtatc tgtagtccca 

34921 gttattcagg aggctgaggc aggaggatta cttgagccca gaaggtgaag attacagtga 

34981 gccatgttca taccactgca ctccagcctg ggtaacagac cgaagcctca aaaacaaaca 

35041 aaccaaaccc ccaaaattca cttaagcaaa cagaaaagta aaattcactt tccctcaaga 

35101 aataacttgc tttaagaaaa tcaaaggaga gagaagagac aaatgtccca tatacagaag 

35161 aattccaaat aatttacgta gctactccat cctcaaggag gtagaataaa actccctgct 

35221 ccttaagtgt gtgctgtgca tagcagcttc tttccaaaga gaacacggcg gagattggag 

35281 cagggaaagg atcaccttat agcagagaaa ccagacaagc gctgcctcta ccaggtggtc 

35341 aaggtcaaca ccaacagtcc taaatcatgt tgacaatatt atcccttgat atgatgtgat 

35401 gagaacagca ctttaactct gggactcctt ccccaaaacc cataactcca gtctaataat 

354 61 gagaaaaaca tcagacaaat gccaatagga gaagcctaca aaatacccaa ccagtactcc 

35521 tcaaaactgc catattcatc aaaaacaagg aaagtctgag aaacaacgct acagctaaga 

35581 ggagcctaag ggaacataac taaatataat gtgctatcct gataggatgc cacagtagaa 

35641 agcagacgtt agataaaaac caagaatctg aatcaagcac agactttagt ttaagaataa 

35701 cgtatcagta tcggttcatc agttgtaaca aatgtaccac actaatacaa gataatgggg 

357 61 aagttgggtg ggaattacat gggaatgctc taaactatct tctcaatatt cctgtaaact 

35821 caaaactgtt ctaaaaatta aggtccattt ttaaaaaatc aaccacaagt tacagtacaa 

35881 attacataaa agaaacaact taccagatgt cactgaacag aaaaaaaaaa . aaaaagttct 

35941 ggtttggtta caggagtcca cagaggagat ggtggcagct tctccttggt tcaggtattt 

36001 ■ ggaaccagct ggggtacgta gccctttggg ccacaatcac aaagccacgt ttacacagca 

36061 gaagtgaaac caacttgcca tcagggagct atttaccatg gtctcctccc acagatactg 

36121 atttattctg tcttctttca cttaaatcta aattcaccag gcacggtggc tcactcctgt 

36181 aatcccagca ctttgggagg ctgaggcagg cagatcactt gaggtcagga gtacaacagc 

36241 agcctggcca acatggtgaa accttgactc tactaaaaat acaaaaaaaa aaaaaaaatc 

36301 agccaggcgt ggtggcagaa ccctgtagtc ccagctactc aggaggctga agcaagagaa 

36361 tcacttgagc ctgggaggtg gaggttgcag tgagccaaga tcgcgccact gcactccaac 

36421 ctggatgatg gagcaagact acatctcgaa aaaaaaaaaa atctaaatgg tacatgaaac 

36481 ggggatcttg cacccttctc tgccatgttc ctacaagtag acagatgtct tttttccttt 

36541 ccaagtgcat cactgcatcc ttccacagca tcttttctct gagggtgaac aatattcaga 

3 6601 tgactggttc actggagcag acaggaaaaa tgccccagtt ccaagacaga caaagcactg 

36651 gctccccgac acccaactat aagacatgac tgggagagat ggaggatatg agataataaa 

36721 tataaagtta ttcttgaccc aaagtcttaa aattcaccaa aaataacatt tgccaaaaat 

36781 aaaacaacat tggattgttt gagtgtagaa atgagtgata taccgcttta cacctctttc 

3 6841 tggaacttaa attttaaatc cgtatatttt tatgtcttta tttagttaga gacaaggtct 

3 6901 tgctctgtca cccagcctgg agtgcagtgg cgtgatcata gctcacagca gcctcgacct 

3 6961 cccaggctca agccatcctc ccacctcagc ctcctgggta gctgggacca tggtcatgct 

37021 ccatcacgcc catctaattt taaattcata tttttaataa acccagaaaa caaaaatttg 

37081 aaagtgctgt ccaggagaga agaaaataaa acccactcca ctttgctgtg tctggcaacc 

37141 caaaccatca catcacatac agggaaaaag ttgtcccttg aaccctcagc ttcagggtgt 

37201 gtgttacctt tttttttttt ttgtgagaca cagtcttgct ctgtcaccca ggctagagtg 

37261 cagtggcgcc gtctcagctc actgcaacct ccgtcttccg ggttcaagag attttcctgc 

37321 ctcagcctcc tgagtagctg ggactacagg ggtgcgtcac catgcctgac tttttttttt 

37381 tttaatttta gtagagacag ggtttcacca tgtttgccag gctggtctcg aactcctgat 

37441 ctcatatgat ccgcccacct tggcctccca aagtgctggg attacaggta tgagctactg 

37501 tgctgggcca agggtgtatg ttacctttaa aggccaactc agtcattacc aagacaaccc 

37561 tcagtactct aggtatgtat ttagtctgaa gaggggattc agaaggtaaa gaggtcagaa 

37621 cagggaaaga attgaagata aatgtttaga ctgtggtctt aaaaactgct ttgaactttt 

37681 ggcaagatga cgtccctcct ccataagaaa accacagaaa caccaggcta aagttcagaa 

37741 caaagaaagt cagcacaaat gccatcctgg atgctgagtc ttatactgat ctctctaggc 

37801 aaaatcacac cttacgctta cattcatgtc acgtgtgctc agctctccct ctactggcta 

37861 tttcttagct tcaacagtta accacaaatg ctactaaatt acttaattat taaaattaat 

37921 tttgagcatt tttaaagctt cctggggcaa ggcccagtgg ctcacacctg tgatccctgc 

37981 attttgggag gccgaggtgg acagatcacc tgaagtcagg agcttgagac cagcctagcc 

38041 aacatggtga aaccctgtct ctactaaaaa tacaaaatct agccggtcat ggcagtgtgt 

38101 gcctgtagtc ccagctactc aggaggctga ggcaggagaa ttgcttgaac atgggaggca 
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38161 gaggttgcag tgagccaaga tcaggccact gcactccagc ctgggcgaca aagggagact 

38221 ccatctcaaa aaataaaaaa ataaagcttc ctgagttcct ttatttgttt gcttgttttt 

38281 gtttgtttgt tttagacagg gtctcactct gtcacccagg ctgaagtgta gtggcgttat 

38341 cacggctcac tgcaacctca acctcctggg ctcaagtgat cctcccacct cagcctccca 

38401 agcagctggg actacaggca tgtgccacca cacttagctt aaaaaaacaa aaaacaaaaa 

38461 aagaaaaaca aaaacaaaac aaaactttag tagagatgag gtctccttat gttacccagg 

38521 ctgaactcaa actcctgagc tcaagcgatc ctcccatctt ggcctcccaa agtgttggga 

38581 ttacaggtgt aagccactgg gtccagcctc tggggttctt tatggtagtc cttggcacct 

38641 gctattggac gcctgaatgc taggcctgtt agatgtttct ccattctctc aggctgaatt 

38701 tccattgagc tcactgaaaa cactgaagca atgagaccgt ccacagagcc tcatcatcac 

387 61 ccccaaccac ctatccggtg ctctgccctc cagcctgttg ctagatgaac tatccatgca 

38821 cccattaaag gccaatccct tctctgtgca ccagattcca tccctctctc taactgaaga 

38881 accatgtaga gtacttcctc ctcttctcta attcaccaaa ctttcactct ccactggatt 

38941 attcccatct gtgaacagat ttgatgtcat tcattccacc ttaacaaaca ctccctggac 

39001 ctccatttcc ctgccaggta ttagtatatt tctctgcact tctttatgac aaaactccta 

39061 ggctgggcat ggtgtcagca ctttgggagg ctgaggcagg aggatcactt gaacccagga 

39121 gttcaagacc agtctgggca acataaggaa acccctgttt ctacaaaaaa taaaaataaa 

39181 aattagccag gcatggtggt gcatgcctat agtcccagct acttgagggg ctgagatgga 

39241 aggatcactt gagcctggga ggtcgaggct gcagtgagct gtgattgtac cactgcactc 

39301 cagcctgggt gacagagcaa gaccctgtct ccaaaaaaac aaacaaaacc ccacaactcc 

39361 tcaaaagagt tgtctctact tactgtctca aattcctttc ctctcaagct aatataaacc 

39421 tattccagtc aagccttcac ccttcccatc ccattaaagc tgttcttgtc aaagtccaca 

39481 atgatcctgg tcaattttca acctttatct ttcttgagcc atcaggagca tttgacctgg 

39541 ttgatcattc cctcctgttt gacaaacctc ctacacttgg ctttcagata accactcccc 

39601 tagttttcat cccatctccc tggaagtgct cctcagtctc cttcactggt tattccctcg 

39661 cttctcaacc tgttaataat aaaatacccc aaggcttcat cttcggtctc tttcctttcc 

39721 acagccaccc tgtttcgtag ctttcaattt cgttcacata ccgatggctc tcgactgata 

39781 tccagtgtca acctctttcg tgtcttcctg ttcactagtc atccgaaaat acaagttcaa 

39841 accaatcccc gccagtccct tgaaacacct ctccacctaa ttttctccat ttcatctaat 

39901 gataactaca tttttccagt cccttggtca aaaagctttg gtgtcacatt tgatgttgct 

39961 gtctgccttt catattccac atctgatctg tcaaaaagtc ttgttgaaat cttcaaatta 

40021 tattcagaat ctgaacactt ctcaccacct tcactgctga ctaccccgat ttgagtctca 

40081 ataatctctg gcctcattca gtggttccta agttttgctg cacgttggaa taacccagga 

40141 tcttttaaac atgctaatgc ctgactccca ccccttgata ttctgattta ataggtgtgg 

40201 gatgtaatct gggcactggg aatttttcac tgctctccag gtgattccaa ttgcagcaaa 

40261 gtttgggaat cattggcctg gctatggtaa ccgccaccca ccLgatctcc ccacttccac 

40321 accaaccccc tcccacagtc tattctcaat gcagcaaata gacatgcttt taaattacag 

40381 atcagatcca ttcaattctc tgctaaaaac accagtggct ccccatctca attagggtaa 

40441 aagccaaagg cttttcaatg gcccacaagg tgttacatga gctgcactgc cccctgtccc 

40501 atgcgccact cctctgacct ctctttcaca ttgcctcacc cactgctccc gtgatgtcag 

40561 cctcctcagt ctccttgaac accccagaca tcctctcacc taggactttt tttctttttt 

40621 gagatggagt ctcgctctgt caccaggcta gagtgcagtg gtgtgagatc taggctcact 

40681 gcaactgcca cctcccaatt ctcctgcctc agcctcccgc gtagctggga gtgcaggcgc 

40741 gtgccaccat gcccagctaa tttttgtatt ttcagtagag acagggtttc accatgttgg 

40801 ccaggatggt cttgatctct tgaccttgtg atctgcctgc ctcagcctcc caaagtgctg 

40861 ggattacagg cgtgagccac tgcgcctgac cttcacctag gacctttgca ctaaccatta 

40921 tcttagccag aaatactctt tcccacatat ctacacatca agtattagct caaatatcac 

40981 attgKcaata aggcctactg taacccctat ttaaaattat acctttaacc acctatctcc 

41041 ctcccctctc tgagataaaa cagggaagag gtagaggatg gtgccattag ataaggaaga 

41101 gcagatctgg agtaataata gagttctgtt tgggacatgt taagtttgag atgcctatta 

41161 gacacccaaa tctaacaaga tatcaaatag gggaaaagtt ggggttggac tggaaacctg 

41221 ggagcaaaca cgtggtgctc tggggatcac cttggagaga caaggctgag tgacccccat 

41281 ggaacactag tattctgagg tgaggcagag acagagttca caaaagaaag agaagagaaa 

41341 ccaagagagt tggagggccc tgtcataaaa gatgttcatg gaagagttga caaaaagagt 

41401 caaactctgt aaaatacttg aagagattta ttctgagcca tgattgacca actgaggcaa 

41461 gacaggcaag ctccaaaatt ggggctttgc ctgggagggt tcttggcttt gcccaggaaa 

41521 caattcaagg gtgagctgat ggtgttaaat agcaacttgc attgaagcag cagtgcacag 

41581 ctgcagcaga gggactgctc cttgccgagc agggctactc acaggcagtg cccagaagag 

41641 cagctcagag gcagttctgc agtcatattt atacccactt ttaactgtat tcaaattaag 

41701 gggcaattta cgcagaaatg tcaagaatga ggatggtaac ttccaggtca tcaggtcatt 

417 61 gccatggaaa ggggagggta atgttcaggt gttgccacgg caatagtaaa ctgacatggc 

41821 atactggtgg gcatcttatg gaaagttgct tccacccctg ccctgtttca gctagtcctc 

41881 aacttgatcc agtgtccaaa ctctgcctcc agaacagagt cccacttcct acctcacatg 

41941 gtctgtgaca gagccccagg ggatcctgag aacatgtgtc caaggtggtc aggttacagc 
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42001 ttgattttat acattttagg gagttataag acattactac atgtaagatg tatattggtt 

42061 tggtccagca tggaggaaaa gttaaatatt aaatctgaat tcaattgaac ctggacacaa 

42121 acaatagtca ccaagtcctg gaacaagttt tgtgagtccc ttgaggcttt catccagcgc 

42181 tgtttcagag aaatctctat ttcaatctat tcctatacat tagttattga aaaacaatag 

42241 acaatagcaa aaacaagttg acctttttgt gttccttgag cctggttgtg aagggccctt 

42301 gtgactgggc ctcatgccaa acaacttgtt acaaaaagag ctagggtccc aggcccagcc 

42361 gaagcttcag gagacctatc ctcatctgtg caaggaggag tggccaactc tggagcccag 

42421 gctgttgctt cctggtctgg tggtgaatcc tccatagtct ggtgagtgta gtgcccaact 

42481 ctggagccca ggatgttgct tcccggtctg gtggtgaatc ctccatagtc tggtgagtgt 

42541 agtgtccaag tctggaaccc aggctgttgc ttcttggtct tgtgataaat cctccatagt 

42601 ctgatgggtg ggtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtataca catatacttt 

42661 cccttctacc cttcccattg caatttgctt attatatctg cattgccatt tacatgggat 

42721 aaaggtcgtt tacccttaaa ggtattgtgt atgtgtcttt tcttctcccc tcacgcatat 

427 81 cccacagaga acaaccagaa aggtgggaca actcaaagca ttgatgtggg gtcagggggc 

42841 aacactttca ggtcacaggc agattcagtt ttctctgact ggcaattggt tgaaagagtt 

42901 attatctata gacctgaaat gaacagaaag gaatgtctgg gttaagacac agagttgtgg 

42961 agactaaggt tttatcatgc agaggaagcc tccaggtagc aggcttcaga gagaatagat 

43021 tgtaaatgtt tccttttttt tttttttttt tttgagccgg agtctcactg tgtcgcccag 

43081 gctggagtgc agtggtgcaa tctcggctca ctgcaagctc tgcctcccgg gttcacgcca 

43141 ttctcctgcc tcagcctccc aagtagctgg gactacaggg gcccaccacc acgcccggct 

43201 aattttttgt atttttagtg gagacggggt ttcatcatgt tagccaggat ggtcttgatc 

43261 tcctgacctt gtgatccgcc cacctcggcc tcccaaagtg ctgggattac aggcatgagc 

43321 caccgcgccc ggccgtaaat gtttcttatc agactttaag agtcggttct gtgctctatc 

43381 agccttaagg tctctgtgtt gatgttaaca ctggttagca gctcctgaat tctaaaaagg 

4 3441 aggagggaat aaggaggcat gtccaacccc acttcccatc atggcctgag ctagtttttc 

43501 agtttaactt tggaatgccc tgggctgaga attattttcg gtttacagag gagaacttat 

43561 gttcattggt ataggatgac aatgaggtca gacaaatacc actgaacttg gcaggctggg 

43621 aagttacaaa gcttagcaag gccagtttca gtgtcatagt ggggtcaaag cctcgttgtt 

43681 gcctcaattc tagtagggta catattcctg tctttattaa tgggagaggt tcttaatctc 

43741 cagcccatgg acaaaggaga tcatagatgg gtttcaggaa aacatcccaa gtcctgcctc 

4 3801 caaattttgc aaaattttgt gcctgtgtat ttttctggag aacgtaaacc aaacagtgtc 

43861 tgagacaggt ctcagtctgg aggtttattc tgccaaggtt gaggacacac ccaggaaaaa 

43921 gagacataag ttatcatgga atctgtggcc tgtggttttt ccaaagaggg ttttgaggac 

43981 ttcaatattt aaagggaaag agcagacaga aggggaaaga ggaacaacta tgcattcatt 

44041 tcacactcag taaatctgcc ttttacagaa gacaaagtaa acatagagga aggagtcaaa 

44101 tatgcatttt tcttggggtg gactgaaggg tgatttctag tcttgtcctt gtcccctacc 

44161 tactgtaaat ttgcatggtc agggtggaat tcaacagaag tgttgtaagg taaagactct 

44221 gccactcaca aggaatttcc ctgtgagcaa ctcctctggg aggccaccta gggagatatg 

44281 tgggcttctg tctttgcagc tgtttaggaa cagaaggaag gcagtttttg cgtgactcag 

44341 ttcacaagct taactttttg ttgttgttgt tgttgttgag acagggtctc gctctgtcgt 

44401 ctaggctgga caacagagaa tgatctccac tcactgcaac ctccacttcc gggttcaaga 

444 61 gattctcctg ccttagcctc ccaagtagct gggattacag gcgtgcacca ccacgcctgg 

44521 ctaatttttg tatttttagt agagacaggg tttcaccatg ttgaccaggg tggtcttgaa 

44581 tgccagacct cagatgatct gctggccttg gcctcctgag atgggcatag tgctagtatt 

44 641 acatgtgtga gccactgcgc ccggccctaa gcttaacttt tttctttggc agagtttagg 

44701 gtcccgagat gttattttcc ttccacaaga gacatccaca gctttcacga tggttaccga 

44761 tcatatgtgc cattcatcta agaccaaggt gaccacctgg tcccaaagcc tgcacctcgg 

44821 attcttactc gattcccgcc tggtctcttc caggccctca gttagtttcc acaacatggg 

44881 agggttcctc ctccgtggtc tcgcccattc gcgttcccac ccaccctccc cggatgtgac 

44941 gaccctgggc tcccactccc ccccatgaga gaccatccac ttccaccctt tcacttaggc 

45001 ttcctggcgc agcctctatc caggaactct gtcccatagc tgtactgccc aggtcgagct 

45061 ttctaacccc aagatgccac tccgttcaga cgttcccatc ctccatccct gagacccggg 

45121 gcaggatcgc agcctagggg cctcacRggc cgcctgctac cctgttagtt gcaacacggg 

45181 gcggggcgtg gctttgtgca cgtcggcttc cgggaagagc tttacgatac attgaccgac 

45241 attttacgac aggcgggatt gttttgtggc tgtcagcttt cYccgtggtc tgagtttgtg 

45301 gctgcatttt tatctctggt ggctctgcta cggcggcgca gaaatgaggc agaagcggaa 

45361 aggtgcgaaa ggggaaggag atgggggaaa ggggtggtcc gaaaggggcg aacgcccagg 

45421 caatcaaatg ctgaaccgaa cttttaccgc gagaatccgc tacccagtcc agtcgccccg 

45481 ccacctaggg tcagtgtgtc cattctggtc cccgagcctt ctcgtttttc ctgttttgct 

45541 ttttgaagca ccctaccctt cctctcttcc tctttggcag tcacgtggtg tcttgtttga 

45601 atggcaggga aaccattatt ccaatatatg ccctccgagg agttaacgtc gatttaacgg 

45661 gttgtaggac ttttcatttg ttaagatttg tttcacgaca catttgacgt gttagagagt 

45721 ttcttttaaa gccttatttt aagatattaa aaaaacctta attatccttg gattcagtgt 

45781 aagagttgtg catgcaatta ttcccatttt tattacatca tgagttagca atgaacaatt 
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45841 ccacctttgg tttttcaaca ggacggattc aaatttgaaa tattctctca caggttctgt 

45901 tcctatttct gtatctcaca ggttctgttc ctatttcatt tgaccagctg gtctaatcat 

45961 ggtaattgtt gtcagaataa tagctaccat ttattgagct tgtattttga agtcggtatc 

4 6021 atttccattt tacagatgaa gaagctgaga ttcaaagagt taagtaactt tcccagacca 

4 6081 aaattatact tagatatgga gctggctggt caggctgtat ctgtgacctc aaaaaacata 

46141 aaatgaaaaa taaatacatc gttaatcMct atgcttcttg acattaactg ttatttttct 

4 6201 ttcttccaag caacaattct aagtgtccaa gttaccttag attaagtctt gtcatatttg 

4 6261 catgccctaa attaatttgt tatctcactc taacttgctc cttctcaatc taccaggatt 

4 6321 cttttagYtt cttctgtcac caagtcatgt tgattcttac ttagaaatta cttcttgata 

4 6381 tagtttattc ctcttcattc tggccgccaa cctcctaacc caccttcatc actaactaat 

4 6441 atttgaatgg tatttcagct ttcatataca ttatttattt gagtcttaga atggttttgt 

4 6501 gaaatttgga tgtattaccc caattttaca gacaaggaaa tgggcactca aaagttgttg 

4 6561 atcttggcca ggcgcggtgg ctcatgccgg taatcccagc actttgggag gccaaggcgg 

4 6621 gtggatcact tgaggtcagg aattggagac cagcctgacc aacgtggtga aaccctgtct 

4 6681 ctactaaaaa cacaaaaatt agccgggcgt ggtggcaggt gcctgtaatc taatcccagc 

4 6741 tacttgggag gctgaggctg aggctgaggc agaggcggga gaatcacttg aacctgggag 

4 6801 gtggaggttg cagtgagccg agatcgcacc actgcactcc agcctgggtg accgagcaag 

46861 actctgtctc aaaaagaaaa agaaaaagaa aaaaaattgt cgatcttttc cgttaccaca 

4 6921 cagccagtag gcattgggac tggatcctgt cttagatttt gtgggttttt taaaacaatg 

4 6981 ttgcctttca agagccattt gagtagtatc catattatat gtattctttt tcactttctg 

47041 aatcattctt tgtaccgata atattcttac cttccttaaa cgttactttc aactttttat 

47101 ttccaaagtt gagaatatgc aattcttcac tgtctgttgc attccatttc aatgcctttg 

47161 cctagtggcc agggtcccct gtaatctgtt ctttattaca ccctgttaga aagctggatt 

47221 ttagttaggc tgtgctgttt cacacaaaga taggtaattc ctatgcctga cattatttct 

47281 tctggattgc cttctgtttt gcatgtccaa atcttgccct atctttaaag atccctgtgc 

47341 catttccttc atgaagtctt ttcctactca tttactcttc cagttcttga tacactcaag 

47401 agctatacaa gtggtttcca aatcttatct gttcttatca gccgatcccc ttttaccctc 

47461 caaataaatt cttacacaga acctcaatat ataaacagct gaaagcagag ctgctgtggt 

47521 ggaaataaag agacctcctc ctttttttcc tccatgcctt attttcctct tcccttctcc 

47581 cttcctctgc catgaccacc accagtccta cctctgttgc atggcccctg aggcaccttt 

47641 gtagaactct ggaagtgctg agttacatgg tttggaaagc atttagcctt tgagtataga 

47701 tggactttat ttctctcaaa aattgttgtg tatatgttag ccttgcctca ttaaattaga 

47761 tgcagtgtgt tacagtgaaa aaagcatgga agttgtggtg aaaagagctg gatttgagtc 

47821 ttggctctgc tgtttacgat agcttgttaa cttgccttag cctcattttt cacatctata 

47881 aaatgggata acagtaccta ccctccgtgt tactgtaagg gttcaaatga gataaagtaa 

47941 atgtgtgggc tccataattg ccagggcctt ccatccacgc tttcattatg ttttagcttt 

48001 ctctgcagtg tgagtcctca gccccctggg ccacggacca gtaccggttc atggcctgtt 

48061 aggagctgag ccagacagaa gaaggtgagc agcaggcgag caagtgaagc ttcatcagta 

48121 tttgcagcca cttcccatca ctcatgttac tgcctaagct ctgcctcctg tcacatcagc 

48181 agtggcatta gattctcaaa ggagtctgaa ccctgttgtg aaccgtgcat gtggtgggtc 

48241 taggttgtgt gctccttatg agaatctaat gcctgatgat ctgtcactgt ctcccatcac 

4 8301 ccccagatgg gagtgtctag ttgcaggaaa acaagctcag ggttcccact gattctatgg 

4 8361 gaacagaatt cgttatggtg aattgtataa ttatttcatt acatataaca atgcagtaat 

4 8421 aaaaataaag tgcacaataa atgtaatgca cttgaatcat cctaaaacca ccctgctccc 

4 8481 ctggtctgtg gaaaaatcat cttccataaa accagttcct tttatcaaaa aggttgggaa 

48541 ccactgctct acaggtgaga agcttgcttg ctgacagctt ccaagctgat atcccacagc 

4 8601 ttttgtccct ggaaaaagac tggcattttt taccttgaag aaaagagctc agattggcct 

48661 ggcttagaga tatgcccatc cttgaaccaa tccttgaact gtgcttgagt ggatggcgtc 

48721 atacaagaac ctgtcagctt ctgctgttgc ctttggatga gaccgagagc cccagttgcc 

48781 agaagttcat gatggtggac agacaatatt ttagaaatgc attaagattc ttaggcagat 

48841 acctgggttt gagtttcaac tttactcctt gatagcatct tcatgtaaca ggcacatatt 

48901 cttgttctgt ctacctctgt tctgagaatt acctgaattc atgcatctaa atgttcttta 

48961 tgaactctaa agtattatgc acatattagt aattacacct ctggagataa agggaagaga 

4 9021 attactttgg tcctttcctg tgtgattaag ttctttctcc tcacattgtt tctactYctg 

4 9081 tatcctttcc tgtatttatt tttaataatc tcatagagta tgtgcttcta aagggagctt 

4 9141 ttcccatata tatgtaaagc cacgtttgtt ttaaagatac aaaagatatg atcaaattgt 

4 9201 aaacagtaga gatttagcat cttctgaact tgagctattc atttggcata tttgtatggt 

4 9261 ttcggatgga aaactgctac taaatcagat tttaaaatct tgattgaatg atacgatttt 

4 9321 tgacattgtt gaatattaac tttccaggag atctcagccc tgctgagctg atgatgctga 

4 9381 ctataggaga tgttattaaa caactgattg aagcccacga gcaggggaaa gacatcgatc 

4 9441 taaataagta agtggatata aagagagagc aagcttgttc ttaggtagca gatctctttt 

4 9501 atgaaattat ctagtactac ttgttttcca gctcaagttt tattttagtt gagaatttta 

4 9561 gctttctgcc ttgccttgtc attcatttct tttttttttt ttgcaagttc cgcctctggg 

4 9621 gttaacgcca ttctcctgcc tcagcctcct gagtagctgg gactacaggc gcccgccacc 
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49681 acgcctggct aattttttgt atttttagta gagacggggt ttcaccgtgt tagccaggac 

49741 ggtctcgatc tcctgacctc gtgatctgcc cacctcggcc tcccaaagtg ctgggattac 

49801 aggcgtgagt caccgcaccc ggccgtcatt catttcttga tggcaaaatt gtctcagctt 

49861 aagaatacta atagcttcta acatttattg ggcacttact gtattccaga cactgttgct 

49921 aatgcattgt atgtattagc tcagttaatc ccctcaaaat tctgtgagat aggtactgat 

49981 accagccaca cttgacaggt gggtaatctg agtcagagag cagttaaata atttgtccag 

50041 gggtaagtaa ctagtaattg gcagagctgg gatttggaat taggtcagcc gtctccagag 

50101 cccatgttct tctgtgctgc atcgctgcac atcctacttt ccttccctac tgtctcagct 

50161 gaagcagcag ctttctgtca ggaaaaccct ccctttgggg cccctgatct aatccaaccc 

50221 tcccctgctg tagaacctct ttgacacatt catccacttc ctctccataa tcattttcag 

50281 cttggacttt cctacagtct caaataagag ttgtctttca tctctgcatc tatttttttt 

50341 ttcaaactat aatatatatt cactctatgt agatcttccc taatgaaagg gttctctccc 

50401 caaccccagg tcgtctaatc atcacataca ggaatgacct tttcttagac tctcttctct 

50461 gatacttgag atgctgcaat attttacccc ttccttgaat cctcatgctt agtaaatatt 

50521 tgctacagga ataagtaaat atcctctttt gactataaag ttattgtatt ttactgattt 

50581 ttctaccgtg tctttagaag tttttctttc tttttttgtt ggttataccc ctgattatac 

50641 aaatctttaa tttctagtga acatttctac atcttatata tatttgttgt tctctttaat 

50701 ttagcacatc taaagctaaa gctgccatcg tccttaaaac tcgcccctcc tcagatgtca 

50761 ccagcctatt actggaaggt ctctcagtct ggcttcaacc tgcttttcca agtttttaaa 

50821 cctctgtacc tttacattga aaatcttctg ctttctgtca agttggtctg cttgtatttc 

50881 ttaaaatata tgcattcctg tttttgcatc tgcacataaa tcagcctgtc tgccctccat 

50941 gtcatctttg tgtctttagt gactgagacc tactctttct tctgtgctca tttccatgaa 

51001 tatctcccca gcacacatca gattcctccc tgaactctgg aagcattcat tgttggtatg 

51061 tgtgtgtgtt ttatgacata tgttttaatc atttatacat taattttctt ggtacgcctg 

51121 tctgcaaaca tgtaaacata gaacatgaaa ggactgtgcc ttgcacattt gaacatagca 

51181 ggtattcagg cagcctcttg tacgtgttag gtcctcagtt aatatttgtt gacaaatatg 

51241 tgagcacttc acagatattt ctagatggat taaagttaga agacagggtg actgttaaga 

51301 gtttggctag gaggcacaaa gaaaagctgg taaaagtttt ttttaaactt tcaaaaatat 

51361 gtactttatt tcctatttgg ttctgcatag ccatccttga tttttttttt aattgttgtt 

51421 gttgttttgt ctaagacagt agttgtcaac caggggtgat ttcctcctca ggacatttgg 

51481 caatgtctag ggacattttt gagtgttaca acaggaagac aagaagaagg ctggtagtac 

51541 atagagacca aggatgctgc taaacatccc tcaattcgct ggccacctgc cacagcaaag 

51601 aactgtgtgg tcccaaacat cagtagtgct gaggttgaga aaggtggagt cagaatagtc 

51661 ctttgaagct gcagcactat tctgaactag atctatattg tgtcatcgtc cagggaccct 

51721 tctccctcca tagtcctctg gttttctttc tggtactcaa gggtgggaag tagtcattca 

51781 ctcgtgaatt ttattcttct atttccctga tagtcacaga tcttaaaact atccttgtca 

51841 cgtggtgaag ggattggaga gtgacagttg ttaatgatgt tgtttataga tcatcctctt 

51901 ggtagcttgt ccttaaataa ccgtRatctt gataatgtga gatgctttac tttcagggtg 

51961 aaaaccaaga cagctgccaa atatggcctt tctgcccagc cccgcctggt ggatatcatt 

52021 gctgccgtcc ctcctcagta tcgcaaggtc ttgatgccca agttaaaggc gaaacccatc 

52081 agaactgcta gtggggtgag tgattcgact catgaggtat cgacacactg ggtatctgtt 

52141 Ytggcagaag tccctgctcc atgtgacRcc cgtgtagtga ggtagaggtt ggggattctg 

52201 aactaatgaa gtccctgtat taggatatgt gctggagtgg agacctgtgc tgaatgcaag 

52261 gagagcagag agagaaaaaa atataattgt gattaaagga gtagatgatg ttttgacctc 

52321 tttaaaaata tagatttagg cttggtgaca atcataacag gataaatgtt ggccatctta 

52381 agctggtagc tttcaatata attgatgctt atgaaaggta cttttgaaag gtagttaatg 

52441 gtgcaaagta gttctgtata tggtgattat gagcactgtg gaagtccaga caacagcgat 

52501 gtccctgcag caaaaggggc tggggaaggc ccagtagaaa gcacacagct tgcttgaccc 

52561 ataagccaag aagagcaggc tctaagtgtc atgggagcaa ggacacaggg gctaaggcac 

52621 agaggtagaa atgacaggaa gggcgtggtg cacgcaggaa atagcgagta cgccagactg 

52681 aagggtgtta acagttcagg agagtaggac tgtgcgttga acaggtagtt tgggatcagc 

52741 ttgtaaaaat cttaagtgtc tggcaaagca gctagataat aggaaacaaa agttttgaaa 

52801 aaggtagtaa tatgaacaaa gtagattttc agaaaattaa tttggataca acatgcaggg 

528 61 tagtttggag acaaaaagac tagcttagat gctcttggta tgatgtcagt atgcagtgtt 

52921 aaggacttga gttaggatgg agactctagg atgaaaaaag atgagtgtga gatacatgca 

52981 cagaaattga tagaattaag accctcctac catatggtcc ccagcaccaa attctgttcc 

53041 taatgtgatc ccacaagttc attctgttat ttgaattggg atattatatt ccaggcagct 

53101 tgtgttgact tggtgaccat atctcctgga tcaccttacg tttgtggcta attaaaccct 

53161 aaaattttta catacaatgt accaccaaac ctcatatcct cttctttgta cttaagtgtt 

53221 tttgttttgt tttgtttgtt tgtatctagt gtaaaaagct tgttagccag gcacggtggc 

53281 tcacgcctgt aatcccagca ctttgggagg ccgaggtggg ccgatcacaa ggtcaggtca 

53341 agaccatcct ggctaacacg acgaaagccc atctctacta aaaatacaaa aaaaaaaaat 

53401 tagccgggcg ttgtggcggg ggcctgtagt cccagctact ccagaggctg aggcaggaga 

534 61 atggtgtgaa cccgggaggt ggagcttgca gtgagctgag atcatgccac tgcactccag 
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53521 


cctgggcgaa 


aaagcgagac 


tccatctcaa 


aacaaaaaac 


aaaaaaactt 


gttaagaaaa 


53581 


actaatagtc 


catgcccctc 


acctcccttt 


ttctacccta 


gggcaaccat 


ttttaactct 


53641 


tagccaattt 


ctttagcatt 


aacttccata 


tccataaata 




ctttacataa 


53701 


tagataagtc 


ttgactttct 


tttttttttt 


ttttacctga 


gacagtcttg 


ttctgttgcc 


53761 


caggcaggag 


tacagtggta 


cgatcttggc 


tcactgcaac 


ttctgcctcc 


caggttcaag 


53821 


cgattcttgt 


gcctcagcct 


cccaaggagc 


tgggattaca 


ggcatgtgcc 


acaatgccca 


53881 




gtattttcag 


tagagacagg 


gtttcactat 


gttggccagg 


ctggtctcga 


53941 


actcttgacc 


tgaggtgatc 


tgcccgcctc 


agcctcccaa 


agtgctggga 


ttacagacgt 


54001 


gagccactgt 


gcctggccga 


ctttttagag 


ttaagcatta 


tgtgtgggct 


tgccattaaa 


54061 


gaagacagaa 


acttagcaac 


ctttcagcct 


gactggcaaa 


ccgaggcttc 


tgtgatacca 


54121 


ccctctctgt 


ttcttcctct 


gtictctigatzc 


atttctgtcc 


tatcttggct 




54181 


gcttacctct 


aaaagttgat 


gttttctgtt 


gtcctgggct 


cattgtgttt 


ggtattttac 


54241 


atgctttctt 


tggtaacctc 


atccatttga 


tgattttagt 




gctgactccc 


54301 


agcatgaacc 


attccctgag 


cttcagactc 


ctgtcagatt 


gtcagttagg 


catctgtcct 


54361 


ttgactgcct 


gagaacct.cc 


tgaagtatag 


cataaccaaa 




gacttgctta 


54421 


ttcacctttc 


ctgtccatgt 


tagttcatgg 


taccaccgtg 


cactcagttt 




54481 


tgaaatgcat 


tttccgttcc 




ggtaagaatg 


ctttggcctt 


tattatttct 


54541 




atttcagtca 


tttccttacg 


catctatttt 


gtcctaccta 


catgaaatgc 


54601 






agtgatctat 




agaaacacac 


agctggtggc 


54661 


cattccctat 


ctactgccat 


tctcacctcc 


atgcttttat 


tcttgctgtt 


tttctgatgg 


54721 


tgctttgttt 


attttataca 


gtagttttag 


gtttatagag 




tatacacttc 


54781 


ctctagcacc 


ttcaccccta 


ccccagtttc 


cctattatta 


atatcttgca 


ttgtttggta 


54841 


catttgttag 


aattgatgaa 




tgcattatta 


ttaaccaaag 




54901 


cattagaata 


cactctgtgt 


tctacattct 


gtaggttttg 




atgtcatqta 


54961 


tccactatta 


ctgtgtcata 


tgaaatagtt 


'tcac'taccct: 


aaaatctcct 


atttgtggcc 


55021 


gggcgcagtg 


gcttatgcct 


gtaatcctag 


cacti: tggga 


ggctgaggca 


ggcagatcag 


55081 


ctgaagccag 


gagtttgaga 


ccagcctggc 


caacatggtg 


aaaccctgtc 


tctactaaaa 


55141 




tiagccgggca 


tggtggcggg 


cgcctgtaat 


cccagctact 


tgggaggctg 


55201 


aggcaggaga 


atagtttgaa 


cccaggaggc 


agaggttgcg 


ttgagccgag 




55261 


tgcactccag 


cctgggtgac 


agagcgagat 


tctgtctcaa 




aaacaaacaa 


55321 


acaaacaaaa 


acagctccta 


tttgtccctt 


tccctctgca 


tgttctagac 


gtaacctgac 


55381 


ttccactgat 


tgttttattg 


tctttaataa 


agtttgcttt 


ttccagagtg 




55441 


taattgqaat 




ttccacttag 


caatatgcat 


tgaagtctgc 


catgtctttt 


55501 


tgtgacttgg 


tagctcattc 


ttttttttta 




gataatccat 


tgtacggatg 


55561 


taccactatt 


tgcttattca 


ttcacctatt 


gaaggacatc 






55621 


gcagttttaa 


acaaagctct 


gtgaaggtta 


ttgtgtccac 


ctacattttc 


agcttacttg 


55681 


agtaactgtc 


aacaagtgca 


actggtagat 




gactatgttt 




55741 


aaaactgcaa 


actcttccag 


catggctgca 


ccattttgca 




cagtgagtga 


55801 


gcactctgat 


gttccacatc 


cttgctaaca 


cttggagatg 


tcggtgtttt 


ggattttatt 


55861 


taattaattt 


atttatttta 


agacagggtt 


ttgtcctgtc 


actgaggcta 


gagtgtggtg 


55921 


gcatgatcac 


agctcactgc 


agcaacctcc 


caggttcaag 


ctatcctccc 


acctcagcct 


55981 


cccaagtaac 


tgggacaaca 


ggcatgcacc 


accacaccag 


ctaatttttg 


tgttttttgt 


56041 


agagacaggg 


tttcaccatg 


ttacctagga 


tggtcttgag 


ctcctaggct 


caagggatcc 


56101 


tcccagcttg 


gtctcccaaa 


gtgctgtgat 


tataggcgtg 


agctatggtg 


cccagccagt 


56161 


gttttggatt 


ttagccattc 


tcatagttga 


acagtggtat 


ctccttgttg 


tttagtttgt 


56221 


aattccctaa 


tgacatgatg 


ttgagcatct 


ttccgtatac 


ttatttgcca 


ctgtatatct 


56281 


tctttattga 


gatatctagt 


cacatctttt 


gccctgttct 


taattgggtg 


ttttcttact 


56341 


aggttttaag 


agttctttgt 




tggaagtggt 






56401 


agatattttc 




tggcttgtct 




tttctcagag 


cacaactttt 


56461 


aaatatagac 


aattaggtcc 




ctgagtttgt 


atttgagttc 


gtgtttgtgt 


56521 




ccatggttgt 


taacttggat 


acattactgt 


catctaagcc 


tcagacctca 


56581 


cttaagtttc 


accagccgtt 




cccacagaac 


ctagttcaga 


atcacctgtt 


56641 


gcatttaatt 


gtcatatatc 


tttagtctgg 


acatttcctt 


tgtctttttt 


ggactccgtt 


56701 


atcttaacgc 


ttttgaagat 


ttctggcaag 


ttattttgta 


gcacgtccct 


cagtgtgggt 


56761 


tcatcagctg 


ttttctcatc 


atgagattca 


ggttacgcgt 


ctttggcccg 


tgcctcatag 


56821 


aagcagcact 




cgtcatctcc 


catccagtgg 


tgcgcaggtt 


tggttttcct 


56881 


atcactgatg 


ttcctcattt 


tgatcaaggt 


gctgtccacc 


agacttaccc 


tctgtcaagt 


56941 


tatttttttc 


cactttgtat 


taagaagtgt 


tgtatggaga 


aatactgaga 


aactaggtgg 


57001 


atatcctgtt 


tctcatcacg 


tacccagttc 


actcctttat 


ttgtgtgaag 


gaattaatgg 


57061 


tttcctattt 


ggtgggttat 


catctgttac 


tatttatttt 


gatgcacaaa 


ttattgtgga 


57121 


cttgaccagt 


gggagccttt 


tcaagctgat 


ttctatgtct 


ttttaaaatg 


tcctcatcat 


57181 


tctttgagca 


gtttctagct 


ttctagcaca 


ataaaatgtt 


ccaggcttgg 


ccagacatgg 


57241 


tggctcacgc 


ctgtaatccc 


agcactttgg 


aaggacgagg 


tgggtggatc 


acctgaggtc 


57301 


aggagtttga 


gatcagcctg 


gccaacatgg 


caaaaccctg 


tctctactaa 


aaaaaaaaaa 
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57361 tacaaaaaat tagccaggca tggtggcaca tgcctgtaat cccagctact tgggaggctg 

57421 aggcaggaga attgcttgaa cccaggaggt ggaggttgca gtgagccaag atcacgccat 

57481 tgcactccag cctgggcaac acagtgagac tccgtctcaa aaaggaaaaa aaaggaaaat 

57541 gttccaggct tatcttagac tttctttgct ccagccctgg aatcagccat ttccccaagg 

57 601 agccctggtt tcttttagtt ggagaaggat atttagatac taagacctgg gtcctaggtg 

57 661 tgcttactgc tgttagggtg ttgctgctgc cagactctct cagtggacca agcgaggaca 

57721 tatatatgta tagctgcata cataacatgc acacatacat gtaacacatt tccatttgta 

57781 tttatttatc agtctaccat atgttgaaca ctctgattgg ccacaatacc tttaattcca 

57841 cccagcccac agagttcatt ctgcttctct ctctttccat gtttatagct acttctctga 

57901 tagtaagaag cctggcttgc tttcactttt gtaaatggcc agatttgacc aagtgccctg 

57961 gatgtaacca atcttgcgtc tctgccactg cctcctgtcg tcacctcact gaggctctgt 

58021 cagacccctc tgaggttatt tacacccaga ccctgaaaca tgaagctgct agtttaatag 

58081 tacctgctgc aaatattgag atccagtgtg ttcatgaggc gtttgagtca caaaggttag 

58141 gtttatatat aatttcatag aattgcttaa agaaattttt ttcttacagg ctgtttacta 

58201 agacaatcag agagagaaag actaagaatc actttggctt taacagttaa tttgttattt 

58261 tgtacttaat ttattgtaaa atggaatata acttcacata tatattacat acggacaatt 

58321 taaagatgat taatattgaa cagagatcac tcttgtaccc attgcccagc ttaagaaata 

58381 cagcctcggc cgggcgcggt ggctcacgcc tgtaatccca gcactttggg aggccgaggc 

58441 gggcggatca cgaggtcagg agatcgagac catcccggct aaaacggtga aaccccgtct 

58501 ctactaaaaa tacaaaaaat tagccgggcg tagtggcggg cgcctgtagt cccagctacc 

58561 tgggaggctg aggcaggaga atggcgtgaa cccgggaggc ggagcttgca gtgagccgag 

58621 atcccgccac tgcactccag cctgggcgac agagcgagac tccgtctcaa aaaaaaaaaa 

58681 aaaaaaaaaa aaaaaagaaa tacagcctcg tcaatacctt tgaagcccct ttttgccact 

58741 ctctggttgc attcctctcc ttccttccga gggataagca ctctgtggag ttttatatta 

58801 attatcctat aatagctttt ttattcatct ttcttttata gattgctgtc gtggctgtga 

58861 tgtgcaaacc ccacagatgt ccacacatca gttttacagg aaatatatgt gtgtaagtat 

58921 ggtgatttta ttaaattgta tgtatgtttt aattaagcta aatatgcccc ctctagccct 

58981 tagtcagtac atcctggtaa tgtttaaaac ttcagcttaa tagatttata gattactcct 

59041 ttcaaacaag caaccattgg tagatatttt agtgctttaa aattggaata tataaggccg 

59101 ggcaaagtgg cttacgccta taatcccagc attttgggag gctgaggcag gtggatcacc 

59161 tgaggtcagg agttaagacg agcctggcca gcactgtgag actccgtctc tactaaaaat 

59221 acaaaaatta gccgagcgtg gtggcatgcg cttctagtcc cagctactta ggaggctgag 

59281 gcaggagaat cgtttaaacc tgggaggtgg aggttgcagt gagccgagat cacgccactg 

59341 cactccagcc tgggcaacag agtgagactc catatcaaaa taaataaata aaattagaac 

59401 atatgaatat tttaatttat tgcaatatac aattctaaaa atgtaggtta tggaactcac 

594 61 aacagtagac attgggatat gcaactcaaa acagcacatt ctgttaaact cataaatgaa 

59521 acatgggaat atgagctgcc taaattccac gtcagaaatt taaaatgaat ttggatcaga 

59581 aacatatcaa aataaaaaat tatccttata cgtaaccttt agatttctca aactcaccta 

59641 tttgaaagaa ttagcggaag agtttgcatg attctgggta ggaagataaa aggggggaaa 

59701 gattttcttg agtgtggttc ccaagagaac actgagtatt gattcaaact taggaaaact 

59761 tcatttctgt tatttgcccc taaaagctta aacctctgaa ataaacacaa ctgcagttat 

59821 tttgaaaatg ggtgtaataa tgtcccttta catatttatc ttatttactt cttggtagaa 

59881 aatacattat ttacagatac tgcaaaggta gattcttctg ttaggatttg aaaaggaagc 

59941 tcatttgtta tcaggattct ttggagatgg tagatgcttg gaactagctg attgaactca 

60001 gttttgcatc tgacattctt gttcttttgt tgtactggca gatactgccc tggtggacct 

60061 gattctgatt ttgagtattc cacccagtct tacactggct atgaggtaca gtaactttga 

60121 ggctgtcctg atgaaatgtt gcatcatgct ttacctgtag tatggtttta ccagtactgg 

60181 ctttctgaca attttttgtt tttgtttttg tttttttctg attttaaaag ctgttcattc 

60241 accaaatatt tgccagtgtc tactagtgcc ctatattatt ctagccacta gagaaatact 

60301 tacataagca taaataatac ttacttctta tcccaaatgc tgttctaagt gctttacaac 

60361 tgtaaacctt attgcagtgt tttgaagata gacactatca tgatcccagt ttgccaatca 

60421 ggaaactagg gcatagcagg cttaaacagc tggcccaaag acacactatt agtaagtgac 

60481 aaccaggttt cgaagtctgt gctgcttacc actacattgt actgtcactt tagcagtgga 

60541 aaatggacag ggtcccattg tcttctggaa tttacatttt tttttctttt tttttttttt 

60601 ttttgagatg gagtctcact ctgttaccca ggctagagtg cagtggcacg atctccttca 

60661 ctgcaacctc cacctcctgg gttcaagtga ttctcctgcc tcagcctccc gagtagctgg 

60721 gactacaggc aagcaccacc acgcccagct aatttttgta tttttagtag agacaaggtt 

60781 tcatcatgtt ggccagcctg gtctcgaact gctgacctta agtgatccgc ctgccttggc 

60841 ttcccaaaat gYtgggatta caggcgttag ccactgtacc cagcctggaa tttatatttt 

60901 aataatggaa ggtagacagt aaagaaacaa gaaaaagtat caggcactca aaaaatgcta 

60961 agcagagatg taaaatcaag taaggtgatg gcagctgaga tggtttagct gtggtcgtta 

61021 ggaagggatt ctgtgaagtg agattgaagt tgagtctgaa tgacaagaag gacctagtcc 

61081 taagaatatg tgaaaggggc attcctagca gagaagtcac tagaatgagg cccaaggaag 

61141 gaaagacatg ggggtgttgt gcagtgcagg gggtgaggga agcatgaggc tgggagtgca 
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61201 aatgagctgg ggtttgtaag caaaggtaag gagtttccat tttagcgtag gagtcatgag 

61261 aagctattag gatttaaggc aggggaatga tacaatccaa tttaggtttt ttgaaagatc 

61321 attttgatgg ccatgtagag aaagggttag agtggagacc agaaagaagg cagaaggcca 

61381 gtgaggtgct tttgaaggag tcaccctcct cagaacacct cagaagccag gaaagcttgt 

61441 gacttttttc tcatatctgt tttcattttt tttcttgttt tctagggttc aaatttttta 

61501 aaatacaaga ggaatgattt cgtgaaaagg cttcctctca ttcctatccc ctagccactc 

61561 tttttcactc ttccccctac cagttagtgt tgtttgttat ttcctcccca tccttcaggg 

61621 atattttagg gaacacagca tacacaggtt gcctatccct tatgtgaaat gcctgggacc 

61681 agaagtgttt cggattttgg atttttctgg attttggaat atttgcatat acatagtgag 

61741 aaatcttgag agtggaaccc gaatctgaac atgagattca ttttgggttt atgtacacct 

61801 gatacaccta ggctgaatta attttataca atatttttaa taattttgtg cgtgaaacaa 

61861 agtttgtgtt aagtactatg tgtggaattt tccagttatg gcatcatgtt ggcactcaaa 

61921 aaattatgag gtttggagca tttttgattt ttggattagg gatgctcaac ctggacgtat 

61981 tctttatttg actgattcca tataaggtag catatcagag tcctctttca ctttgccttt 

62041 tattttacaa tatcttagtg aacattttat gtcagtacat tgtttcgtgg ctgtagagta 

62101 ttccactgta tggtggtagc cattacaaac tgtgtggtat tttagaagct tacttaaaag 

62161 ttactttatt atatgattct ggtatatgta tgcacatcta tgtctgtgag cagaacactt 

62221 tggtgaccct gggattccag aagtgtttat acaaaagaca gatgtgatcc aaggagacac 

62281 cctgctgttg aggtgtttat gacagcgtga gtggacacct gccagatgcg attcaggaca 

62341 ttattttgaa ccctgacaag actgagaaaa attaatgcgg gtacaagcca cgttttcagt 

62401 gttcggaacc atggagagtt ttttttaaaa tacagtcctt ttgaaactac tttttagttt 

62461 taattcaRtg tgggcataac aatatttttc tcttctagcc aacctccatg agagctatcc 

62521 gtgccagata tgaccctttc ctacagacaa gacaccgaat agaacaggta catttttaaa 

62581 aaacatgttt cttaaaaatt aggtgtttat acttagtaag aagccattgt tgcttgattc 

62 641 aaattgaacc tgaaataaga atgaaaaagg tgtttttcct ctttgtaagt tttcaatatc 
62701 catttgaggg agggagaatt tgccatgcct agcaaggtca aaaacactac tttctttaaa 
62761 gactgtattt attgtttaag ggttttatat tctctaagtt tttttgaatt tgtagaaagt 
62821 catttgtagt atgaaatttg tggaataaag atgtatgaaa gttcttagac aatgggtggg 
62881 tgtgttgact tttaatttcc aaaagtcaga ttaagaagta ttttgactgg ccatgcgcaa 
.62941 tggctcatgc ctgtaatccc aaccctttgg aaggctgtga cagcaggtca cttgagccta 
63001 agagttcgag accagcctgg gcaacatagc aaaaccccat ctacaaaaaa tacaaaaatt 
63061 agcatggcat tgtggtgtgc acctgaagtc ttagctactt ggaaggctga ggtggaagga 
63121 tcccttgagc ctggaaggtc aaggctgcag tgagctgtga tcataccact gcacttcagc 
63181 ctgggtgaca cagcaagaac ctgtctcaaa aagaagtatt gtgacagatt tgttgggtgg 
63241 aaataggaaa tttcctacaa aggagtacaa agaactagtc ggggtatggc attgttctct 
63301 atcatggLca tggtggtggt tacataactc taaatatcta tcagctctca tccattgtac 
63361 acttaaagtt agtggatttt atcgtattta aattatacct cagtatggtt gactaaaaac 
63421 aagtactatg tacatgacct tgcagtgttc aagaaatctg aacattaata cagatttcct 
63481 ttatttacaa gtttatttta aacttgtcca atttaaaaaa tgtaaagcac tgtccatagt 
63541 tgtaatagta atgtatagta ggcacatcca agtctaaagt agataatggt acataaccat 
63601 agtggataag ttgtctctgg gtttgtttat tggcttattg gtgaatactg ttcagtttta 

63 661 atatccactt tgctgtcacc caagcgtatg aggaacagga ttgtcggtga caggagagga 
63721 ctccatctgg gggagcccac atttttccaa acagtgggtt ctaaactgac ctttgcttca 
63781 atttcttttt gggctatgat agttaattta tttaaaatgt aaaactattg agcatgaaat 
63841 gcttatgttt accaaaaaaa ggagcatagt ttacaagatt tagaaatgaa catagagcag 

63 901 tgattctttt cttaaatgca ctagaattac ctgagtaact tctccaaaac gtgtcacatc 
63961 ttcacctcgg gaggttctga ttaagtctgc tttaaggtcc tagcaagata tttttaaaaa 
64021 ctaccagggt aattttgatg agtatctctt gtaaagaacc atagatacag aaatagagta 
64081 ttcttttagt gttgatatat atgtacacac atgcatatat atagtttttc tgtatacgtt 
64141 ttttgccatt ttcagaaatt agtgttaatt tcaataccta tttttaaaaa ttagaatctt 
64201 ggcttattgt agtcaacaaa atgaaagatt tgtatcattc tctccactag tagaggagac 
64261 ctaattttat tattattatt tttttttttt taaacagagt gtcactcttg ttgcccaggc 

64 321 tggagtgcag tggcacaatc ttgggtcact gcaacctccg cctcctggtt caagcgattc 
64 381 tcttgcctca gcctccagag tagctgggat tacggcatgt ctggctaatt tttatatttt 
64441 tagtagagac ggggtttcac catgttggcc aggctggtct cgaactcctg acctcaggtg 
64 501 atctgccagc ctggtcctcc taagtgctgg gattacaggc atcagccacc gcacctggcc 
64 561 tgaatatttc atttttaatc agactttcac ttttttttag aaagcagact tgaagtgcct 
64 621 cctgtgcctg gaatcatcca tcaattttag actgctgtct tgatttttct ttccaatcta 
64 681 ttcttttttt cttccattac atcaaatcct tattatgtta cataatcatt catgtatcat 
64741 tgttgaccat aaatgccacc tttttgtcct ctaggcttac caccaagtct gacataaaat 
64 801 acatgatcaa taaatactta ctgttttgca aattgtatta tatttgtctt tactgctttc 
64861 tagtttatat tcttcgtgtt tttaaatttc cgctttgtag gtattcaagt caagcctctt 
64 921 atttgttatt ttatattctc attctccctc cttatttgag ttgtactcac ttttttcttt 
64 981 cagacttggg cctatttttt atccacagaa ttagctaagt gtgtttcatt acttctgatt 
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65041 tttaaactgt actgatgaaa acactgcaaa ataagagatt tgcaatgcct tcttagagta 

65101 gttccttatg cttatatcat tctaatgctg atgaatttgt ctttcagtta aaacaacttg 

65161 gtcatagtgt ggataaagtg gagtttattg tgatgggtgg aacgtttatg gcccttccag 

65221 aagaatacag agattatttt attcgaaatt tacatgatgc cttatcagga catacttcca 

65281 acaatattta cgaggcagtc aagtaagaaa ttcttatttt atcatagtct ccagagtggt 

65341 tgtcagttta tgctcctagc agtagtctac gagaatgcct tctgccctgc atccacattc 

65401 ttacttctca taatctttct tgtttcatgg gaaaggatta tttcagtgaa aataatgctt 

654 61 tcgctgaaat aatcttttcc agtgaaaata atcctttcac tgaaataatc cttttaaaga 

65521 aaaaatgaat acagtttgtt gacatagtag tattctcaaa tagagagatt cKccaaatat 

65581 ggtccatgaa tgtcttttca gcccacccaa gaagggggct gggagtattt tagctgctgt 

65641 aaaatcagca aaacaaggta ctgttattta aaaacctaat ttatagtaaa tatttcataa 

65701 ttattaaata gtaactgttt atgattggga tcttagtctg tgttgtgctg ctgtaacaaa 

657 61 atatctgaga ctaggtaatt tataatgcac agaaatgtat tggctcacag ctttggaggt 

65821 tgggaagtcc aatgtcaagg tgctggcatc tggcaagagc cttcttacta cgtcatcaca 

65881 cggcaaaaga caagagaaac aaaaagcgga ccaattcacc cttttataat ggcattaatc 

65941 ttacccacaa ggtcagatcg cctctcagag gtcccacctg ttaatactgt tacaatgaca 

66001 atttcaacat gagttttaga ggggacaaac tcatgtctgt caaatcatgt gttattcaaa 

66061 ccacaagtta atttagtcat tttgaattct agttgacaaa attatgcatc attttgacac 

66121 cttgttttta gcaagaagaa tactacaggt tagtatgtag ttcagtgatt taagaagtga 

66181 aagtcttaaa atagtttttg ttctcaggag ttgcaggaac acctggatag ttactatttt 

66241 ccttatttaa caaatccttc ttgagggccc actagatact atgtgctgtt ttagatactg 

66301 gaaagtggtg atatttaggc tgagaccgga agtacaagga ggagttaggc caacagcaag 

66361 aaagataaaa taaaggctct gaaatgggaa gcagcctggg gtgttctagg gacagagagg 

66421 aagccagcat ggttagatcc tgtgttaatc catttgtgtc cctataaagg aacacctaag 

66481 actaggtaag ttataaagaa aagaggttta attggttcca gttctacagg cttcacatga 

66541 agcatagtgc tggcatttgc ttctggtgag gcctcaggaa gcttccaatc atggtggaag 

66601 gtgaagggga gccagtacat cacatgacgg gtgcagggag gtgccacact cttttaaaca 

66661 acaaatctca cgtgaaacaa ctgagcgaga actcacttat caccaaggag atggtggtaa 

66721 gccatttatt tatgaggaat ccagccccag gacccaaaca cctcccacca ggccacacct 

66781 ccaatattgg ggatcacatt tcaacataag atgtggaggg gacaaacacc caaaccatgt 

66841 cagatcccag tgaacaagag aaagaacgtc atgagacggg gttgaagtgt taggtaaggg 

66901 ccaaqataca tgcttttaag gaqttgaaat tttatttgaa atgcagtagg gagcagatga 

66961 aggggaagtg gcaagttctg gttaacaggt agggtcattc cggctactgt ctggctaatg 

67021 gattagagga qtaccaqggt gaaagtggga aaccagatag gaggccatct gattacatcc 

67081 atccctgctc agatgggggc ggcagcagtg gtgatggaga ggagattgag atgggggtaa 

67141 gagaaaagaa gggatcaagc ctgacactaa ggttttggct gtcagaaatg gtaggaaggt 

67201 agggtagggc tgtatactga ggtgggaaag atagcgggaa gagcagagaa aatctcagag 

67261 ggagaaaatc aagagttctg ttttggatgt gtgaagtctg aagagcctgt gatacattca 

67321 agtggagatg tcggtgggct ctgaatgcaa gagaagtctg agctgacggt acgaaactgg 

67381 ggattatcag ctcataggta acattgacaa ccatgtaagt ggaggagacc acctcatggg 

67441 agagttgtgg atctccaagg tatactaag't gaaaagcaat ttttagagca attcttatag 

67501 tacgatccca ttatttgtgt gttcatgcac atacacacac acatatctgt atataaatgc 

67561 atagaaaagg tggcagaata atggtcatca tagaccttag agctgaggag gaaaggacat 

67621 gggaaatggc agcaaaggag gatatttaca tttgctctgt atacagtggg ccaggtgttg 

67681 tcatgggtgt ttaatattca cttatttagt actcatagtt agcctttgag ttaagtgttc 

67741 agattatctc tgttttatgg gtgaggaagc tgaggcacag agagataagc aatttgccca 

67801 aagttgcaga ggtggttggt ggtagaatgg gatataaatc ccaggtagct ttgctttcag 

67861 agcctaactt tgcaagctgt gctaggtgtc agaatgtgag tgtgtctgta tgtatgtgca 

67921 catgtgtgtg cacatcatca gagcttgaag atcttggaag gaatatggcc tgtttttcct 

67981 tgccctcctt ccctaccacc ctcaggcttt tctctggctt ctcttttata tggggtgagg 

68041 gtttcatata gctaattata aggttgttca aatagtgcca cctcttaaga ttttttgtgt 

68101 aggacaaaat tttggataga cctaagagtg gtttttatta ccctgtaagt aaagcagttc 

68161 ttggcacata gtaagcacaa gtaaatgcgt gaatgaattt tgaatgaaca gttagctaat 

68221 gacctgggta gggttgcctc ttggaattgg gggcagccac atctttttgt gccctcgcta 

68281 ctccccctac ccccttaact tcctttgttc tccttgggtt tgtaaaagtg aaaagaagag 

68341 aggagctttt tcataaaatt taataccaag ggtagctcaa agagcccatc tgaaaggttt 

68401 ggcagctggg agagtttgtg tggacagcag cccacttctg tttgattgac tctagggaat 

68461 gcaacaggtg aattctgtgt ccgtgaatct ggacctgtag cattgtgatt tcttcgtctt 

68521 acaggggctt tagtaataga ggagatggcg actgcattgt tactgctcgt tcaaaactga 

68581 tcaagaggcc gggcgtagtg gctcacgcct gtaatcccag ccctttggga ggccaaggcg 

68641 ggcagatagc ttgaggccag gagtttgaga ccagcctgac caacatggtg aaacttcttc 

68701 tctactaaaa atgtaaaaat tagctgggca tggaggctgg tgcctctagt cccagctact 

687 61 tgggaggccg aggcacagaa acacttgaac ccaggaggca gaggttgcag tgagccaaga 

68821 ttgcactact gcactccagc ctgggcgaca gagtgagact gtgtctcaaa aaacaaacaa 

163 



PATENT 
Docket 524593008700 



68881 


caacaaaaaa 


aactgatcat 


taatatgagt 


catacttagt 


aaatgctgaa' 


gtcttcaaac 


68941 


tttagaggag 


taatgatatc 


atccagctaa 


ttactcttaa 


taatactgaa 


aaatcaaact 


69001 


ataccttaga 


taaaatgtga 


ttgaggaaaa 


acaaccttta 


ttagttcaaa 


gccaggcgac 


69061 


ggggatggca 


gcagaaggtt 


ctctcagagg 


gttgctgacc 


acagttcatt 


cagctctgaa 


69121 


aattccctgg 


cagggacatc 


tatgaagata 


agtttttctc 


tgcaagctta 




69181 


tactcatttc 


ttggacctta 


atatgtaagg 


tcttcttatc 


ttgaagacct 


tacatattaa 


69241 


gtggaattga 


gctgtaaata 


tcttagactt 




cccataaaaa 


tttgccacta 


69301 


agcttttcat 


ctcctacagt 


ttgggtcccc 


tgaggtatat 


gaagcaggcc 


aactaagatc 


69361 


tgcatagtga 


acttttagta 


tgtatctagt 


ttgacatttt 


catcaattga 


aagtaaaaat 


69421 


tttgttttat 


tcttggtgta 


acattttatt 


tttgcagaaa 


tgttctagtg 


ctaatggtgc 


69481 


ttgaatgtaa 


gttttccatc 


attgggttga 


aaatagggtt 


gtctagtcca 


gcgagctcag 


69541 


tgcagatcat 


gatgtgtttg 


tagaaaaagc 


cctgtggaag 


agaaatcctc 


tttcagtaat 


69601 


attctaggca 


gtgccagtgt 


tgttttgttt 


ctgttcttga 




agagggcaac 


69661 


gaacacttta 


ttttcagata 


aaaatttata 


tatgatttgg 


gtcttcattg 


caacacatct 


69721 


catgaatgcc 


tcttgagaag 


taatgaaagt 


acaatctggg 


agccataaaa 




69781 


attacactga 


attctgccaa 


cacacacttt 


aaatgttttg 


ctcttttctc 


tcagtctcta 


69841 


tatttttatg 


agatcatctg 


gaaaaaaaaa 


agacctgatt 


tgtggcgtgt 


tgttgctttg 


69901 


ttaaggtaaa 


gttttactac 


aaacccctca 


taatagagtt 




ttgagggaaa 


69961 


ctttgtattt 


gaggaaataa 


tagtctagtt 


tgtgctatag 


aactagagac 


agaaagtatt 


70021 


ttcaagtgtt 


ggcataattg 


tgaaataaaa 


agcagcccag 


agaagttgtg 


gttttgacat 


70081 


aatgtggccc 


tcggaaatgt 


ttggatttga 


ccttgccctt 




ctgcccagag 


70141 


tctatgagtg 


aaaactggtt 


ggtttgcaca 


gcgtagccca 


ctgctcttag 


atgtaaggtg 


70201 


atgaacttca 


tgtttatttt 


acttttgttt 


ttgcttgctg 


actacataga 


tgtaaactga 


70261 


ctttcattag 


cttagcaggg 


ttttttaaag 


attaatttta 


aattaggtta 


aaaatgatgt 


70321 


attgtgacct 


atgagtcagc 


aagcagcatt 


taaggttaat 


agtctgttca 


cgttagggtc 


70381 


aagttttact 


gctgtgttgg 


ctcaggtgtc 


cctgctatgt 


tttca'tatgt 


tgaacctgat 


70441 


taaagttttg 


cttcttaaaa 


gaataggagt 


taaggtaaag 


aaaagcccca 


gcaagcagag 


70501 


cctggttatt 


atttatggca 


agctagtagc 


aagcagtgtg 




tctcgtggat 


70561 


ggataaattg 


gaaaqttgag 


tgaacagaga 


gttcaaggac 


aaaacaggta 


tggcttttgt 


70621 


gaaggctcat 


taaatcaagc 






ctatcagctg 


gactgagatt 


70681 


cttcagtagt 




caacaattac 


tggtgactgt 


catcgtgtaa 


caatcaggct 


70741 


ctggagatga 


aaaagaccgg 


tagtgggatc 






atgcaaatgt 


70801 


tgccaaataa 


ctccaacaac 


cttttaaaat 


aqttttattc 


ctttttaatc 


agctttgccc 


70861 


agaagcagtt 




tctttaatgc 


tccttggctg 




atgcaattta 


70921 


aagggtaqtt 


acccattaaa 


aagtgagtga 




ctccctgtgg 


aattttaaat 


70981 


tcatttcccg 


ttccttcctc 




ccgcccccca 


cgccccatta 


atgactttag 


71041 


atcctccaac 


tatqttctta 


cctgtctgag 




gtgctaggta 


atgctaggta 


71101 


ccaggcccag 




cgtagacttg 


cacagctgca 


acggaagcaa 


aaggaaccta 


71161 


cagagacctg 


agagtgagtg 


actgtggccc 


tgctgccctg 


ggcgtcattt 


ctggcaggcc 


71221 


tcaggacctt 


ctgcatttct 


gggctttgac 


gctgacactg 


cttatctctc 


actttttcta 


71281 


ttgaccattt 


tactttctct 


tttggtcacc 


cagatttcca 


tacatggtga 


ccaggatcct 


71341 


taacattggc 


cagagaacat 


aggatacaat 


cttagtcact 


ttaagagagt 


tgatatggtt 


71401 


tttctttcag 


cattttattt 


gaaacaaaaa 


ttaaacagtt 


tttagtgagc 


atccacatac 


71461 




gattctacga 


tacttgcttt 


atcacatatc 


tgtcagttcc 


actatccatt 


71521 


catcagtgtc 


tctcgcgtgt 


gcttgcgctg 


tcttttttga 


tgaatttcat 


agtaagttgt 


71581 


atgcttcagt 


acacttctcc 


cgggatactt 


catcatgcat 






71641 


gtctgcagtg 


tttttctttt 


gaagtaaatt 




acaaaacaac 


ttgtggcgtt 


71701 


ttatatatat 


atatatatat 


atatatatat 


atatatatat 


atatatatat 




71761 


acacatatat 


taagtatacc 


atttgattat 


ttttgacaaa 


tgcatatacc 


tgtgctacaa 


71821 


agtcctatta 


agatacagaa 


tgtcaccgtc 


atcccagaaa 


gttcccacat 


cccacttgcc 


71881 


agtaaatcct 


cccctgcgcc 


tcccagaggc 


agccgttctt 


ctgatttttt 


tccccatcac 


71941 


aaattagttt 


tgtctcttct 


agaacttcat 


ataaatggaa 


ccatatagca 


tacacttgta 


72001 


ggcctctctc 


actgagcata 


gtattttgag 


atttatccat 


gtgttgggtg 


attcattagt 


72061 


tgttacctat 


ttagtgctga 


gtagtattcc 


attgtatgca 


gagatcacag 


tttgtttacc 


72121 


atccttctat 


tgatagacgc 


ctgagctgtt 


ttgtttgtgg 


ccattatgaa 


taaaacttca 


72181 


gcgtacgttc 


ttgtgtaagt 


ctttttgtgg 


ctatatgtat 


ttatttctct 


tgggggaata 


72241 


aatagacata 


gaattgctat 


gtaagtttag 


ttttacaaga 


aaccgccagt 


cattttccca 


72301 




actatttgta 


ctcccaccaa 


taatgtatga 


acatttggtt 


gtaccacatc 


72361 


ttcaccaaca 


tatggtgtag 


tcactctttt 


taattttagc 


cattctagtg 


ggcgtataat 


72421 


ggtatctcgt 


ggttttagtt 


tgcttttgcc 


tgatgactaa 


tgatgttgaa 


cactttttta 


72481 


gtatgtgctt 


atgctatttg 


agtatatttc 


ctttgtgaag 


tatctattaa 


aatcttttgc 


72541 


ccatttttga 


ttaggtggtt 


gtatatccta 


gctgccagtc 


ctttgtcagc 


tctatatttt 


72601 


gcaaacatga 


aaacccagtc 


tgtagtttgg 


ctgtttgtta 


tgttaatgat 


atcgtttagc 


72661 


caaagttttt 


aattttgata 


aagtagaatt 


tagcagttgt 


tttctttcat 


ggttattgct 
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72721 tttctgtatt gtctctaata aaccattgca cgttcccaag gcacaaagat attctcctgt 

72781 gttttcttct aattacaggt ttgagctttc acttacaggt ttatgttcca tcttgaatta 

72841 attcttatgt gtaatatgag gtggggatca aggttccttt ttccccatat agacagctag 

72901 ttgctttaac atcacttctt taaagatttt ccttccctat tcggattata tcacaccttt 

72961 gttaaaaatc gaaggactca gtaaatgtgg gctgggctct tttctgtttc atcgatctgt 

73021 ttttcaatcc ttatgccagt gctacactgt tttgattact gtggcttttt agtgtatctt 

73081 gaagtcaagt aatatgagtc ttctaacttt gtaattgttt ttcaaaattg ctttagaaat 

73141 tctaggtcct ttgcatttct atgtaaaatt tagaatcagc tggccaatgc tctattaaaa 

73201 agtataatgg atttagaatt gtgttaaaac tatagaacaa atggaaagaa ttgacaattt 

73261 attgcttctt gcaattcatg aacatagtct atctccttgt atacttaggt ttttaaattc 

73321 tcttagcaat cttcattgtt gagattgtat aagccttttg taaacaaatt ctttcaaaat 

73381 atttgtatgt gttttggtgc tacagtaaat gaaatgtaaa tttcattttt aaattttatt 

73441 attattatta ttattttttt ttttttgaat cggagtcttg ctctgtcgcc caggctggaa 

73501 tgcagtgacg tgatctcagc tcactgcaac ctgtaccttc tgggctcaag acattctcct 

73561 gcctcagcct cccgagtagc tgggattaca ggcatccacc accatgcctg gctaattttt 

73621 gtatttttag tagagatggg gtttcaccat gttggccagg ctggtctcaa actcccggcc 

73681 tcaggtgatc cgcccacttc ggcctcccaa agtgctggga ttacaggtgt gagccaccac 

73741 gcccggccat aaatttcatt ttttcaaatt tttgctgcta atatatatac atacggttga 

73801 tttttatata ttaatgttat gtcataagac cttactaaat tcactactta attctaaaag 

73861 ctatttttgt aaatccttta atatttactt cctaaacaat catgtcatct gcaagtacag 

73921 tgcattttac ttttcccttt tggatttgta tgcttttctt tctcttgcct tactgcactg 

73981 cctaggacct tttcttacag tgttaaacag aagtggtaag agtgggcgtc tctgtcttgt 

74041 tcccagtgat acagggaaaa catttttatt tcagtattaa gtccagtgtt gcctgtgggt. 

74101 tttttatagt tacatgtatt agattgaata agtttattga aagggtttat cattaactca 

74161 tttgtctgat gctttctctg catctattta aatggtcata tgattttcct cctttatttg 

74221 gtaatatgga tcattttgat ttttttttaa cattaaacct cacatgccta ggataaaccc 

74281 tattatatca tcatctttac atattgttgg attcaacttg ctaatacttt gtagaggatt 

74341 tttgtgtctg tgttcataca gggtggtggt ctgtaatttt cttttttata attttgttgt 

74401 caggtattcg ttgttggtat tagtgtaacg caggcttcac aaaacaagta aggatgtgtt 

74461 gttccctccc ctgttttctg aaagtgttca tgtaacatga atatgatttc ttccataaac 

74521 gtttgctaga actcaccagt gaaactatct agggctggaa ttttctttat gggagggttt 

74581 tagatcataa ttcagttcat ttaatagata tagagctatt catattttct gtttcatctg 

74 641 tgtccatttt aaaaagttac gtttttcaag gaatttgtct gtttcattca ttttgtcaaa 

74701 cattttggta ttatgttgcc ttattaggct tttaacatct gtggaatctt agtgatcacc 

74761 cctgtttcaa ccctgatact catcatctgt gttttctctt tttttcttgt ttacccgagt 

74821 taggggttta tcaattttgt tgttcttttc aaagaagtag cttttggttt tatttcctct 

74881 actctgtaga cttctgcttt tatttttatt ctactttctt tccgtttaat tgctcttctt 

74 941 tttctagtga tttaataagg tataaaagct tggccaggcg caatggctca cgcctgtaat 

75001 cccagcactt tgggaggtga ggtgggcgga tcacctgagg tcaggagttc gagaccagcc 

75061 tggccaacat ggcaaaaccc cgtctctact aaaaatacaa aagttagcca tgtgtggtgg 

75121 cacgcacctg taatcccagc tactggggaa gctgaggcag gagaatcgct tgaacctggg 

75181 aggcagaggt tgcagtgagc caagatcacg ccactgccct ccaggctgga taacagagtg 

75241 agactccttc tcaaaaaaag agaaaaaaag cttggccatc attttagaca ttttcctcaa 

75301 agcactgctt tagctgaatc ccacacattt tgatatggtg tattttaatt attattcaat 

75361 tcaaaatatt ttttcatacc ctttatatat atgtatttga tccatggaat gtataggaat 

75421 ggggtgttta atttccaaat ttccagacaa tgaggttttt cttgatatct tattaatttc 

75481 taatttattt tcattttggc cagagaacct actctgtata attttggtgt tttaaaattt 

75541 attgagactt gttttgtggc ccagcatatg tggtctctct tggtgaacat gccatgtgtg 

75601 tttgtaaaga atgtgtgttc tcagttgctg ggtgtcatgt tctataaata tcagttaaac 

75661 caagatggtt ggtagtagtg ttcaggtaaa ttttgttttt tattcttttg tagttctatc 

75721 aattgctaag agattgaaat ctccaagtat gattgaggaa ctctgtacat ctctcttcat 

75781 ttatattgat ttttactgaa tgtattttgt aaatctgtta ttaggtacat acacatctat 

75841 gattgctgtg ttttcctgat gtatgagctt ttcaccattg tgaaattacc tctttatcat 

75901 catgagatgt ccttccgcat ccctggtctt gcagtttact tggtgttaat ttagccgtca 

75961 tgtgcttact gtttgccctg tgtattatcc ttttccatac atttgctttc cacccatgtt 

76021 tctttatctt gaaaatgcat ttctttagac agaagtctac agtaattggc tctttttttt 

76081 tttatccatt ttgctcgtgt gtgcctttta attggagtgt ttagtctgtt aacatttgat 

76141 gtaattattg atacatcagt ttaagctgat ggtttgattt atatctgcca gtttaatcat 

76201 ctcctcactt tggttttcag tagccaagaa taatagttgt aatgaatact attatggtct 

76261 aattctttat aatgtatttt ttctatatcc tttaataggg aatatcttct taagagaaag 

76321 gtagaggact ccttatatct agtacaatgc cttaagcata gaattctggt acttaataaa 

76381 tgctaagtga atgcggtgga aagaactgtc ccttaagaat caggagacat agtctccaga 

76441 cttatttata ttattatttt gcattattac ctggtttaat ccacttattt tgtgtgtgtg 

76501 atttagtttc ctcatctata aaaatgagga ggtttaggac tatattatct ctaaaattat 
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76561 acctttctgt catctatgat ttgaagcttt ctaatgaaga gaatttttat ctaaaagata 

76621 tgttcacaag ttattcttca tttagcaacc actttctgac aatcattttc tataatgttt 

76681 ttatgtatat aacccttaaa tttcaatgtg gatataatat taaaagaatg caatatctgt 

76741 gattcttttt ttcttttcta ttaataatgt gttgctgctg gttctttagc tacacgaggg 

7 6801 gactacaaaa tgatagtttt tgtcagccat gaaaaaaatc aaacctcaaa caaaatgtta 

76861 taagctgttt ttatatatct taagccctgc cagctaatgt gatagggcac aagggtcttt 

76921 gattagtctg taagctgcag tgccaccagg ttggtttttc ataggtagta ctcattttta 

76981 aatcaaaatt ctgtcgttac ttcattttgt gttgggcgtg gttaatttat agaacctcat 

77041 gatataacca ccaatatcga ggaagcagac ttgactacca tctcaaaaaa agattggggg 

77101 taatagttat ttttaaatcc tcaagcacat caaacatcca actcagttaa gtctagagca 

77161 tcactagcag agcattgggc agaacttcaa attttattga ggtattttca agatagggta 

77221 atagttaaga agaggaattt gttcatggtg ctgctgccaa ggagttacac caataactca 

77281 aggtgttatc aacaattcaa acaaatctgg ctgttcaaag aagtgagtaa actccactga 

77341 cttgttttgg gaccgtagat ggagaagaga ttgtatgttg tgactaacga aaatgaatat 

77401 ttcatctctg taccattttt tactgtgcaa tttggtttga acaggttgag tatgaggttg 

77461 cagcatgtcc acacagggaa tgttctgtaa gccattcagc aacttgagcc ttgagttctt 

77521 gtgagagttt aagcaggact agactcaggc acatccagtt gcagtaagga cagaggtttt 

77581 gaaagaggag ctgccacgaa ttatttgtaa tgagaggtgc ccacctcttt gataccacag 

77641 cttcttggag tgaaacaaaa gatagtttcc agaagataat aaagagactt taaaatcagt 

77701 gtgctatctt ccttccatca gtgtccccct ggttgggccc ttagtgcaag gagacagtaa 

77761 taatagatag tgcttctgtc aaaaggtcgc tttcttcttt ccagaaatag aaacatgctt 

77821 cccaagaaat aatctaaatc tatttatatt tctgccactt ctagcttttt gttctgtagt 

77881 catttctctt tttttttttt tttttttttt tctgtatgtc ttatcccccc agctagattg 

77 941 ttaagcatgt ctgggacagg aactattata tttttacttt ctgaagaata cctggcctac 

78001 tgctaggcac cttctaaggt taactttttg caagaggaaa gataagcggt agatgtatct 

78061 ggaggctgct gttggagtga agagagacct cctatgttcc cagttatgcc taattcattt 

78121 attccttctt caaatttgta attttttttt aatcaacgga gaatttttag tgttgaagcc 

78181 ttttagcctc tagacatgtt tgagcctgct ggctttcagg ggcttcctgt ccagatggag 

78241 agacagacat aaccacaaat aacaaggctg actacttgta gtgtgggcag ggtgcttggg 

78301 aatgggagca ctgagatctc actgagggtg gaggggttct gggaagatgt cattagggga 

78361 gaaatgtatt ggagctctgt tttgaaggct ggtagccttt ggtttttgtt. tgcttttaaa 

78421 ttcctgggtt attgtcccat ttctcttcac ccctgctcca caatttttaa aaattctgtc 

78481 aacgtaaggt tttgacttaa gctatgcttc acagagaaca tagcatcttt tatagctggt 

78541 gccactccta tggcatacag aaaagcgtag gattcaacat aaccccacag tgttgagctg 

78601 tacaggctta aaatgaacct gtaataccac aaaaagagca gtggaattgg aatcagagga 

78661 tcccttttga gtcttggcct tgtctctcga gaagttgaca gagttgctgc atgccgaaga 

78721 acttgtagcc cccaaagagg tatgggagtt gaactaggta ctgaaggaga gccatgcttt 

78781 tggatggctg tctaagtggc acatactgtt ggatagctac ctaaatgata cagggggatc 

78841 atattaatac caggccataa agtgtcagca cagattggtt gaaagcctgt atgcgcatat 

78901 ttgcatatta aagaacagtt atgttgatat atttacatct ctgttaaatg aagaatcaga 

78961 tttggtgaga tgtaggatta gatatagtat caaaaaattt tcatgagaat acagtaagcc 

79021 tatgggaaaa attcattgtt ttgtcattca aatttgatac aaatttctgt ttaattgctt 

79081 ttcagatgMa aatacaattc tggaaagagg aaatatagtg aaagcctttt tataaatata 

79141 aaatattttt tcaaaacttg agttgttttt ttcttccacc atttctgttg ctgcaaaagg 

79201 taatgacatt tcccgcctga ggaaaaacat ttttgaaatg gagttaaatt attatttgag 

79261 aataaggttt ccttctgtgg cctgtactat attctgatga cttataaaga gaccttgtgc 

79321 agcagtgtcc tctgtatgtc cttggtgggg cctttgacac tctgcaatag gaagactagc 

79381 taataaattt tctctttcct ctacacaaat tcccatttgt gcaattgcct cccctctagt 

79441 ttttcccagt gatgtaagct actattatgc caccacgctt gctaacttaa tggtatcact 

79501 tttaagaaat actgcaggtt ttttaaaaat accagttgga Rtgcctcatt ctaggatatt 

79561 ctcatttact tttaaaaacg tcatttagag ctttttgctt tctggggttt tttggtactt 

79621 ttttttcaac ctttgtatgg tgtgctttct ccataatata tgaatattta tttttatttg 

7 9681 aaaaatgttt tccctcaaac ccaataattg atgctggagg aaggtgtgtt acgtctctcc 

7 9741 tgtggcatca tgtactgtta ctgcgtgcct tagtacccac ctgtttaagR ggcacaggac 

7 9801 ctgatgattc tattgtttca gagaagccaa attagtttgg atcctgtctt aggcaagatt 

79861 tgataagatc tgagcccttt ttctgtctca tagttcatct tttagtgact ttgaactagt 

79921 tgtttacctt tagtcctgtt gcctgtggta atcctaccta aaacctcgcg gaatacagaa 

79981 ataaacaatg caaaagagtt caaatgtcca gaatgaaaga tttgagaatt tgttctctag 

80041 attggtgatt cttagctttc ttaattatta ttggcaataa ataagtccgt ttttattgac 

80101 aattaagagt gaaagagtga acctcatgga attagttaca aaaattacag aaattttatg 

80161 atttttactt ttcagtaaag aacaatgaac ttcttcagaa agaaggaaaa taaatgggaa 

80221 atttttttta aaggcttctc tcatcctccc cccaaaattg agaacattgt agaagggtgc 

80281 actaagaaga atgatggggt tctaagaatt gagagatgtt ggccgggcgc agtggctcat 

80341 gcccgtaatc ccagcacttt ggggggccga agcaggtgga tcatttgagg tcaggagttc 
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80401 gagaccagcc tggccaacaa agtgaaaccc cgtctctact aaaaatacaa aaaaattagc 

804 61 tgggtgtggt ggcacgcacc tgtaatccta gctactcggg aggctgaagc acaagaaatg 

80521 cttgaaccca ggaggtagag gttgcagtga gccaagatcg cgccactgta ctacagcctg 

80581 ggcaacacag tgaaactccg aatcaaaaaa aaaattgaga gatgttgaaa agcagagaag 

80641 tctggggtgg gtccacgctg agttgtctaa gcagtgtggt aactggaata cagagcaagg 

80701 actttgaagt caggtggacc caacttcaga tcctgactcc gattttatta gctctgtgcc 

807 61 cttgaacatg ctgtttactc cctttcagct tcagtttcat cacttgtgaa atggaggtaa 

80821 taggacacac ttcatagagc tagtgggagg attcagtact agagtgtgtg gcacagagct 

80881 tggcccacaa cagcctctgt aaatgtgagc tcctgtcgcc tcacttcact agccctgcta 

80941 gcaagagact tacttctctg cctctttggc cagcattaag gccactaact gagacagcat 

81001 gaatctaaag tggtgctcgc tactctgata catgatttta gtaaaagaca aaaatgattg 

81061 gtaattttgt aaatcattgc taagattaat agactaggtg aaaaggcttt tgtgtatata 

81121 gaagacattt catctaaaat attcctataa tcattacata tgccctcccc cctttttttt 

81181 ctttcattta aaatatagag tgggaaataa ataatgtaga tggttttcac ttatgtgaaa 

81241 aatggttgct aggaattgaa aaataggttt tcacagttga aatcactgct ttcaaggaga 

81301 ttatggttgc agcttactgc aggagaataa taatgataat gatgataata acattactat 

81361 gctagctcag tcacgtcaat cttatgaggt gtagatagtg tattatcctc attttgcagt 

81421 taaaggaacc gaagcacaga ggttaatagc atccccatgt gcatagttag gaagtagcag 

81481 agccaggatt tgaactcagg cagcctggct ccagagccta cactctttac cactatctca 

81541 tactaggagg acagagagaa ggcagttttt tgagacagag taatgtctta aaacctgcaa 

81601 caggttaatt attttagatt tgctctacag gtattctgag agaagcctca caaagtgtat 

81661 tggaattact attgaaacca gaccagatta ctgcatgaag cgacatttaa gtgacatgtt 

81721 gacctatggc tgcacaaggc tggagattgg ggtgcagagt gtttatgaag atgtggctag 

81781 agacaccaac aggtaagatg gtggcaggtg atcttgcaca agtcttcctc caagttcacc 

81841 attttctcta cattcatacc cagcctttct tccttctgac cactcttagg gaaagaagta 

81901 tgggtattcc tccttttcag agttctttct tctgtctgtg ttcttaattc catcccctct 

81961 ttccctcatt ttcagtcttt ctctagtggg tcttttccag cagcctgtaa acacactcat 

82021 ctctctccct cttcttgtcc taagcagcct tgtccatata gagagcaggg gagaggactc 

82081 ggggttagta gtttaaagca ggagagaagg ccaagaacaa agaaaagagt ttgaataaaa 

82141 aggacatcag ggtaatgttt aagagattca tttttgtgga gcaaatacca actagcaaaa 

82201 agtttggtga tgccatggtt gtaaactttg agaaatttgg gagtgcaaga aaagaaggaa 

82261 gtaagagttg ttagagcttt attatgttcc ctctagaagt atatctttta gttgaaaaga 

82321 aaatacccgc gaaaggttaa ctgtataagg aacttcacag ttaattgcta aattagtgga 

82381 gtaggggaga caattgagag tttattaggc agctgagatg attgaccgag aattctaacg 

82441 gtgggctttg gaggaaggat gtgttgagtt ggatgttaaa ggccccttta ttctgtagga 

82501 tttgggtagg tgtgattggt gttgtgccag gagaaggaag caaaggggaa ctagggaaac 

82561 aagcactgat cagcatctta gaaacaacca agaggaatgc caggccgagg aggatggaat 

82621 tttcccttaa ggaatagaga tctgttgaat attttttagc atggggaaga atctcatgga 

82681 tatcatgttt aattaagaaa gagtaaccga ataggatgct cttatgtggg aaaaggcttc 

82741 aaccagggaa actgtggagt gctgtagcct tcaagggctg tgctagagag acagttgtgc 

82801 agatggaggg gaagggggag ctttctgaag aatacgacat ctgagttggg acttcatgga 

82861 agagtctgag ttggtcaggt gcagaagagg aagattgagg cagaaagaaa ataatgaact 

82921 cctagagatg ggagtgccat agcacatttg aggaatttgg agtggcttag tttgattaga 

82981 atgtgaaata ggagagtgat aagaggtgag gctgaaaagc aggcagaggg gctgattatt 

83041 aggggcctga ggagtcagcc cagggaggct ggacatgaat ctaatggagc aagccaccag 

83101 cacgcataaa gctacaatgt gatggagcat atctgcagct tagaaagatc gcttacagtg 

83161 taaggggcag tcagagaaga gcaagttgga agggagatca ggtacgaggc tgttacagaa 

83221 attaagcagg aaatactgat gtcctggaaa gaagtgaatg gtggtgattg gtgactgatt 

83281 ggaagtgagg ataagagaga atgggaagtc aaggaggctt ggagagaatg tgggtttcat 

83341 ttggctacac taagtttcag atacctgtag aatagccaag tgaaggtttg ctttagagca 

83401 gggctaggca atagacagtt cagcagtgat caaaatgttc tgtgcctgca actgagtttt 

83461 ccattttaat tcttctaaat ataaatttta ataaccacat gtgcctagtg gctgtcattt 

83521 taaatagtgc tgctttaaag tattggttat ttgggtttgg aattcatgag agaggcctgg 

83581 ccttgtccta ggaattggga agtcatcagc aaaaagacat tagaaattga agccatggaa 

83641 atggttggga gcattctgag agagtgagat ggagtctgtg gtttctagga tccttctgcc 

83701 gtgaatggga acaccttcgt ggcagaagga tagccacaag acttcagttt agttgcccaa 

83761 aactattgtt aggacttgtt tcctgctgga ttggctctag aatgccaagt taagatgatt 

83821 tatccacgtg accccaaatg actctgcctt ctcactaagt cattctctct ttctactgtt 

83881 atttaaggat cctccaaacc caccagtgaa atctattttg tatcataggt ttcactcttc 

83941 ctccctgtct acaaataatt tcacaagatc tttaaaggaa agaacatagc agcttgctgc 

84001 ttctgcatgg ttttgagctc attttaattt ttaatacagg tatttacatt cataagctta 

84061 ttttgactgg tgatgctaaa ccaaataatt ttaatacaca ttaacttttg ccatcccaat 

84121 gctttacttt caagacttta agtagatgtg taagagaatt tctgagaaat atctcagaaa 

84181 taaagttatt tacttccagg cctcctgaga ggtggggatg taaaaaatga ctttctgaga 
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84241 


tgtcttttcc 


cccagccata 


tgattttggt 


tatggaactt 


ctgttacctg 


tctcagagat 


84301 


tgttttcccc 




tgttaacttt 




tatttgtgta 




84361 


cacaaatagg 


tagtgatttt 


cacaggtagt 


qattttcttt 


ccttaggggc 


cacactgtga 


84421 


aggcagtgtg 


tgagtcattt 




aaqattccqq 






84481 


tgatgcctga 


cctgccaaac 


gtgggactag 


aaaqaqacat 


tqaacaqttc 


acaqtaaqtq 


84541 


tgacttcagc 






ctacatqttt 


cttatcccat 


ctaqtcttat 


84 601 


tgcttgttca 


ctgttgatgt 


tttccagtgt 


taaaaaaatq 


catccttatt 


atagaatatt 


84661 


cigaaacacag 






ataaccccat 


caaacaaaac 


tcttaacatt 


84721 


ttaatgaatt 


ctttccaaga 


tttttctatg 


cataqqtttt 


H- „„„„„„ 

utggggggcg 


33?gg?^^^^ 


84781 


tgcaaatata 






atqacataqq 






84841 


tcacttaaca 


ttttaacaag 


catttatcca 


tqttqtaqcc 


tqqtctctqt 


taacatqtta 


84901 


acatatttaa 


atgtcttttt 


aatttaatat 


qtcatacctt 


qcttaaaccc 


ttqcaaqttt 


84961 




aagatatact 


ggttttttaa 


tattatttaa 


aaaacactqc 


aqqtaaataa 


85021 


ctttatgtgt 








tttatqataa 


attccatqaa 


85081 


gtagaattat 


caggtaaaqg 


taqaaacatt 


ttaaqaatct 


cactqtatta 


ataaqttqct 


85141 


ttgcagagag 




^^3^3^^^3^ 


ttcaqcaqaq 


tttttcttcc 


caaataattc 


85201 


tctcctgttt 


attttctata 


tatqcttata 


qatqcatat 


cacatat ta 


catatatttt 






aqtttqatca 


ataaaaqtqc 


cattttaact 


tqcatttt a 


aatt taata 


85321 


atacattttt 


caaqtttqta 


tttactaatt 


acatttcttt 


catctttttg 


gaattgtgtg 




tttttatcct 


ctqactcatt 


attctattga 


gataagaata 


tttttagtat 


tagtttatat 


85441 


gaatttttta 


ggtaaggaga 




attggttttt 






85501 


ccttaaccta 


ttgttgacct 


tttacttgtg 


cttccattaa 


gtcgatcttt 


tttttccttt 




tttttttttt 








tttctgttct 




85621 


actctgggtt 


gttttattgt 


tgttatttta 


actctcacag 


tttcttgatc 




85681 


ttgttgtggg 


aaatgagatt 




tttgtttttc 




ccgatagggc 








acaatattta 


tcaagttatt 


tcatctataa 


aagtactlilit 


85801 


aatttactct 


atttaaaaaa 


atgcagttta 






taataattct 


85861 


ttcctctcaa 


gaagtttact 


tttattattt 


tttaaactqq 


taaactacac 




85921 


gtttatgaga 


taacattaat 




ccctcttcqt 


aattccccca 


caat^^atat 


85981 


gtaatagact 


qtqaqtctaq 


tatcctcccc 


tcagtaacta 


ataqccaqqt 


actcaaaca 


86041 




aqocacctaa 


aacacacttt 


ttattatcat 


qattattatr 


3^^3^^3^^3 


86101 


tactttaagt 


tctaqqqtat 


qtqtqcacaa 


cgtgcaggtt 


tqttacata 


tatacatqt 






ttqqctqcac 


ccat aactt 




tattt 


ctcctaat c 
c cc aa go 


86221 


tqtccctccq 


cctqcccccc 


^ ttcttta*^ 


caca*^ctcct 


tat^at tt 


ccc g 


86281 


tqtcccaato 


tqaaacacac 






a ^ a'^acca 


gtaggatagc 


86341 


caqaactcat 


aaaqtttaaq 


tqctcct aa 


aaatata ta 


t^tttcatac 




86401 


cataqcaaat 


aqctaaqtct 


caqtacatat 


^^^.^^^^ 


gtcttcaaca 


taaggaatag 


86461 


ttcaqtaata 


acattgcaag 


qacaaccttt 


cttqtaaaca 


gatttatttg 


cttgtttgta 


86521 


gtctttqcaq 


qqaaactcac 


aqqqaatatq 


aqtqqatcat 


atttcttttc 




86581 


ctagtaaaga 


gtaacagaag 


qaaqctaaaa 


tqaaaaatct 


gagttctttc 


taaaa^aoaa 


86641 


ttctctacaa 


attaaattgt 


tttaaaataa 


ataaatttta 




aaaaacaaac 


86701 


agctcagtag 




aaaaaacttq 


aacaaacatt 


tcacaaaaqa 


qaatattcaa 


86761 


atagccaata 


aacctatcaa 


aaaqttctca 


aaqtaattaa 


tcttcaaaaa 


aqcgaaattt 


85821 


^^^^^^^^^^ 


ttqqtctcaq 




caccttgqtc 


ttqtattqt 


qatataattt 


86881 






aqaaatttat 


tacagatttt 


ttttatacat 




86941 


tgtatgcatt 


gcctctgttt 


cctccaaqtq 


ccttttgttt 




gttgqcc 


87001 


ccttaatgta 


cttggctttc 


cccaaqttta 




caaccaqttt 


cttatqcctt 


87061 






ggagtctcgc 


actqtcattc 




qcaatqacac 


87121 


gatcttggct 


cactgcaacc 


tctgcctccc 


qqqttcaaqc 


aattctcctq 


cctcaqcctt 


87181 


ctgagtagct 






ccacacctqq 


ctaatttttt 


gtatttttag 


87241 


tagagacggg 


gtttcaccat 


gttgaccagg 


ctqqtctcqa 


acttctqacc 


tcqtqatccq 


87301 


cccgcctctg 


cctcccaaag 




acaqgcqtqa 


qccactqcqc 


ctqqcctaaa 


87361 


gcgattcctt 


atgcttttaa 


gagtaggggt 


ttgaaaagcc 


ggatggcaag 


gcctgtgtgc 


87421 


ctagcttgca 


ggcgtgcttc 


actgaagaat 


gttcttttag 


caataagcgc 


ttctttcttt 


87481 


atgggattcc 


taaatgtcag 


tatcgtgaac 


tcttacgtag 


agccatttga 


ttcatccaga 


87541 


gatgaactct 


ccaacttcct 


gctttgggct 


gagtggcctg 


agtatatctc 


aggaagttgg 


87601 


ttgctgacat 


ccagagagta 


gggcggagag 


tccaccagtc 


tatgtgctgg 


cttttccctg 


87661 


ttttcagcct 


tggtcttcat 


ctagaccttc 


ctttgtacct 


ggaatctcag 


agcgcagagg 


87721 


ctttccagtt 


tctccaggga 


ctaaatgttc 


tcatctgcct 


gttcgggaaa 


ggctaagggt 


87781 


taacggataa 


ttccctctat 


gtacacactt 


gatccagtac 




atttccatgt 


87841 


ctctcctcca 


tcctgcccag 


ttcttggtac 


ctctgatccg 


tgagcctttc 


tggaactctg 


87901 


cagtgtgatt 


gagcaggcct 


ttcgttactg 


tctccttttg 


ctattttctt 


tacttgcaaa 


87961 


gtcattcaca 


agccttccta 


gtttctgtct 


ttgaagaatt 


tgttgccatc 


ttgggtctca 


88021 


ttcttgcctc 


ctttcttgtt 


ctctttgttc 


ttctgggttt 


atacttttta 


aaaaattctt 
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88081 tcctgttcct tttttttttt tttttcttga gatggagtct cgctctgtca cccaggctgg 

88141 agtgcagtgg cacgatcttg gctcactgca acctccgcct cctgggttca agcaattctc 

88201 gtgcctcagc ctcctcaata gctgggatta cagacgcgca ccaccacgtc cagctaattt 

88261 ttgtattttt agtagagatg gggtttcacc atgttagcca ggctggtctt gaactcctga 

88321 cctcaggtga tccacctgcc ttggcctccc aacgtgctgg gattacaggt gtgagccact 

88381 gcgcctggcc cctgttactt tagtgtggtt ttggagggaa agaaaataaa cactttttta 

88441 aattctctag tgtaactgaa aattgagaac caatcaattt tcattctttc caaaggaatg 

88501 tatcagcctg tatgtctccg aaacttatta tacaagcacc agaaatactt ttctaaatcc 

88561 catgttcaaa tgtagattca gactgggtga acaatggctt ccatacgtgc tctttgtggg 

88621 atggctgcag aatatttatt aaagacctga tgattcttta aaatataaat gttacaggaa 

88681 aatcacacaa ttacagaaga agatctctga aactaaatag tacataaaga atacagcatt 

88741 taagaacgtg acagatgtca atgagagatg gaaaagtcta ttgttatatt tctttgtaaa 

88801 gcaagttaaa caatgaagag ctttggaaga ctaatgtggc aaagataaaa agtatttttc 

88861 cagtttacaa gatagagccc tgccagctga gtagatactc ccaccaaggt gggagtgaga 

88921 ggggactgtg agtccacttt catttctcca aaatatatta tgcaaaacaa gattagagca 

88981 aattatcagt gcaaagtcat ggaaaggaga atgcctggtg tctcattgct acatcccaaa 

89041 ataaagaaag aaagcagcca gctggggtga tgtagtataa gaaatgactt gccaaaagta 

89101 gttgagttag atctattgct cagtgtccag ataacaaatg gacaaggtgt agcccaggat 

89161 aggaatggtg ccagttggtt tagggacaga gtagtcatat ccagggatca gcaaactttt 

89221 cctctaaagg gccctatagt aaacatttta aattttgtaa gctagatgat ctctgtcaca 

89281 acttttgaaa tgtaagcaat taagtatatt ccattattaa agaaaggtat actctagagg 

89341 tgaaggcaga cattgaatag attgtcatgt gatgataaga aagaggtaat actgggttca 

89401 gtgggatttt ttttaagtgg gacagctagt atttgaaagt cagaggggcc tctctgaaga 

894 61 agtggcattc aaactgaaac ctgaagatta gctagataaa gaaaaattga tgaactttcc 

89521 aggcaaagga aattgccttt gcaggagtgg aaaggccaga tggtgagggg tggcatgaga 

89581 tgagattgtg caggagacaa gctggaatgg tgagggccta gtgcagtcca gcacacactt 

89641 gctttgcccc agtgagacta cagaaacaag gagtttctgt tctgtctgta cgccctacct 

89701 ggtcagaagc aaaggctgcc ccagggccta ctgggtgtgc cagagaagct gtcaggggtt 

89761 gagatttcac cctcggtgat ctctgcataa ctaatggaga agtcattttc tgttctctat 

89821 tcacaggagt tttttgagaa ccctgctttt cgtcccgatg ggctgaaact ctatcctacc 

89881 ctggtgattc gtgggaccgg gctttatgag ctttggaaat caggaagata taagagttac 

89941 tctcctagtg acctggttga attggtqgct cggatcctag ccctcgtgcc tccatggact 

90001 cgagtgtacc gagtacagag gtagtgtgtt atcttttatt cctaaaatag ttggtgacta 

90061 gtctgtttac tatttctcat ggaaatagtc tgatttcata ttgagggttt tggatttttc 

90121 ttaatggaaa taagataact ggaatgctat ctgtaaatag ggagggatgg aaatcatagc 

90181 atgtctaagc cactttgcca ataacgtatt tatttatcta cccattcatt catgagcctg 

90241 gagacagagc catgacggtc aataggcatg gtgcttgctt ccgagcagct tatggtctag 

90301 ttcagtgttt cctcttccag gtctgcttcc atctagatgc agtaatgggt atgagcataa 

90361 gaagtgtggc cgtgtgtgca atctctgttc tagagcctct gaaagaaaaa gtagcaacaa 

90421 tcactcttta cagatattaa tgtaaatgtg gaggaaaggt gacatatttc tgatggcttg 

904 81 aagaaaacaa aataatctga actgctttct tcctagaaaa gagaaagtaa gatctcattt 

90541 acaatcagga accttatcta cctatttata cttacataca tacatacata tatactggaa 

90601 acataacatc acagaaaaat tttagaagcc ataaagttac ccgtatacct acaatcctaa 

90661 caaagccaat tcccacacac acattcccac caccctgcca aaaactaccc aggttccaat 

90721 attacaaatg caaggccaga aggagactgc aaatgcatta caatcagctg ctagagcagg 

90781 actccactga gcacagaaac tgtgtgactg atcatgtaaa gcaatgaaca ctgaaaacaa 

90841 gcYgaattaa ctacttaagg agaattatga aggatataaa gtaactgact tgcttgaagg 

90901 tcaaaggaca tttacatgat acttctgctg catactgatt tctcagtttt aaaatcattg 

90961 gccaaactgc aggatctRaa ttgcctatat ggtctctatt tttaaaaata cacctaagaa 

91021 tactaaggag attttaatag aaaaatcaac tSttgaaatt gcttgtgtgg cccttcccct 

91081 tgttgttcaa cccttctgaa gcaattcagt ccaaaggaca ttaggtggtg YRgagcagtg 

91141 taggtatcca catgcaggag gcagggagcc acagggtcca gagcagggtg agaaggtcac 

91201 tcattcacat gcaggaggca gaggcgtggt gcaggctgat gaagtcagaa tgtggcgaaa 

912 61 agggcattca ttcccaccaa gggcagatct ggtacaggat gtcagagctg tacgtcctct 

91321 aggggtgggt tgggggtata tgcagaagag gagacagcaa gaaaagacca gttacttaca 

91381 gggagttgat ctaatcagca gatatattaa ggatactggg tgctaggttt tttttgttat 

914 41 cttagaagtc aattacaaaa gttgaaaaag gagaaaatta gtgtgagcac tgtggtgttt 

91501 tggaacagga gatagtggtg tgaactcatg gtttccaaca tataggtaca tgtagaaata 

91561 agtgtaaatg taatgaataa caacacaggt ggcccttcat atcttcggat tctgcattct 

91621 acaaattcaa ccaactgcag atggaaaata ttcagaaaaa acagtgggtg gttattgcat 

91681 ctgtactgaa catgtataga tttttttttc ttgtcattac tccctaaaca atacagtata 

91741 acaactagtt atagtactta cagtgtatta gatatcaata aataatctag aggccaggcg 

91801 tggtggctca cgcctgcaat cccagcattt tgggaggccg aagtgggcgg attacctgag 

91861 gtcaggagtt cgagaccagc ctggccaaca tggtgaaacc ctgtctctac taaaaataca 
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91921 


aaaaatggcc 


agacgtggtg 


gcaggtgcct 


gtaatcccag 


ctactcagga 


ggctgaggca 


91981 


ggagaatcac 


ttgaacctgg 


gaggcggagg 


ttgcagtgag 


cagagatcat 


gccattgcat 


92041 


tccaacctgg 


gtgacaaaag 


tgaaactcca 


tctcaaaaaa 


aagtaatcta 


gaggtgattt 


92101 




ggaggatgtg 




atgcaaatac 


tacaccattt 


tatatgaggg 


92161 


acttgagcat 


ccatggattt 


tggtgtcttt 


gggagtccta 


gaaccaatcc 


cctttggata 


92221 


ccgagggaca 


actgtacata 


ttttgtgtgt 


gtgtgtgtgt 


gtgtgtgtgt 


gtgtgtgtgt 


92281 


gtgttttata 


gcaatcagtg 


tacttagttt 


ccagtgagca 


cagctggttt 


ctagatcttg 


92341 


gtctctaaat 


gctctccatt 


actaaaagga 


acctcttgtg 


ccagaaagta 


aggaagtatt 


92401 


caaagaagaa 


aagggaatct 


ggcttaaagg 


gctcccattg 


gccaaatctg 


ggacaatttg 


92461 


agcatcaaaa 


taaatattga 


tattaacaga 


ttataaccag 


ctgaataaaa 


taggaatcca 


92521 


ctagtttata 


ccaatattcc 


taattaataa 


attgtaagtg 


tgattcagaa 


acataatatt 


92581 


tgcagtcagt 


cttcatcatt 


ctcagattcc 


ttatttgcaa 


attttcctac 


tcactaaaat 


92641 


tcgtttgtga 


ctcaaatcca 


atactggcat 


tttcacagtc 


attcttgggc 


atgcttagca 


92701 


gtgaaaaatt 


tgagttgccc 




ccagctgagg 




ggtgctctgc 


92761 






actgggagcc 




ttgcagtctg 




92821 


tgtttttcaa 


agttttgtgc 


agttttttga 


ggattcgcca 


tttagtatgg 


cccccaagca 


92881 


tagtactgaa 


gtgctggcta 


ttgctcctga 


gtgcaggaag 




gccttctgga 


92941 


gagaatacat 


gttatgtaag 


ctttgttcag 


gcatgagtga 


tgtgccattg 


gccatgagtt 


93001 


cagtgttcat 


gaatcaacaa 


tatatatcag 


ataaggtgtg 




aacacacata 


93061 


aaacaaggtt 


atgtattaaa 


cagtcgacaa 




accagggcct 


cacgggaacc 


93121 






gagcaattac 


tcagtataca 


gtgtaagtgt 


gttggtgaca 


93181 


ctcggtgtag 


aatgttaact 


acttattgcc 


tgctaacaga 


actagaacca 


actaccgact 


93241 


gtacatagtt 


gcagagtatc 


tcccccacca 


tacttagcaa 


ttgcagagtg 


aaagagaata 


93301 


acctaatagt 


gggaaaacct 


ggcagctgat 


caaagtgaac 


attatcagta 


ataggacaaa 


93361 


tcacaattct 


aggctgcctg 


atagagggta 


tgagaacgta 


gcatcacttc 


tctgttactc 


93421 


agccagaact 


tcacgatctg 


aatccaatag 


tcaggagacg 


ttagacaaaa 


caaaattgag 


93481 


aaacattcta 


aaaca'taac't 


ggcccgtaal: 


ctticaaaagt: 


atcaggatga 


agaaaagagt 


93541 


tctaacttga 


aggagactac 


acagacatga 


caattaaata 


caatgttctg 


aactgggtcc 


93601 


tttggctgta 


aaggacttta 


atgggacact 


ggtgaaactc 


aaatggggtc 


taggattaga 


93661 


cactagtgaa 


gcatccatgt 


taacttcctc 


attttgatgg 


ttcttttgtg 


gttatgtaag 


93721 


agaatatcct 


cgtaggaaat 


atttaaagta 


ttctcagggc 


gttgggacat 




93781 


aattaacttt 


caaatagttc 


aggaaaataa 


ttatttgtac 


tgcacttggg 


actttttatg 


93841 


taactttgtg 


attgtttcaa 


aatctaaact 






tgcagtaaat 


93901 




gagacaaaag 




tttgcatgta 


taactgactc 


aagcaaccag 


93961 


aaagcagtaa 


agatgcagct 


tctcagccgc 


agagcatggc 


aacagatagc 




94021 


tagcaatttc 


acatccagaa 


ataccgaaag 


taacaaggca 




aggcaatcag 


94081 


atgaagaatt 


tgcattttct 


cccttcgttt 




tttctttctg 




94141 


tgagtcaaaa 


gcagcagaga 


cattgctata 


ctattaaatg 




tagactttta 


94201 


aataccacta 


tgcgacatta 


tcctggccta 


ccaacaactg 


agaaaataag 


gacttgaaga 


94261 


aaacaagggc 


agcagaaata 


gtctctatag 


gaaaaatgta 


aaggagtgat 


gacaggaaga 


94321 


gtggagaaca 


cttatatgat 


acaaccgaaa 


ggtcatatga 


taagaaaggc 


atgacattat 


94381 


taataagcaa 


catatgaggc 


ctgttaagga 


aacctggctg 


ttctcagttc 


ttctgataYg 


94441 


accccgttac 


acttatccta 


tggagaattg 


gcccatagga 




gctctaattt 


94501 


caggaggccc 


aactgtcaca 


tatgtttctc 




tcacttatgg 


agcttgtgag 


94561 


tgaggagtgg 


tacaggtgga 


tgtaggctta 


agatttcgaa 


gtgacactct 


tacttcacac 


94621 


Wtaggtgggt 


ggattaaaat 


gtaatggaat 


actgtaaaac 


aagaggcagt 


ggtgcaatct 


94681 


ctttcaaaat 


gtggaattag 


taatgtcctc 


gatgagtctg 


aggacagttt 


attttttgaa 


94741 


gatagtgatg 


actcagttcc 


taacacagta 


aggccattgg 


tgaagagagt 


catccagagt 


94801 


ctagctgtga 


tgtcaggttt 


ttatgcttta 


ggattgataa 


tgtagttaac 
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1 


tcaaatcttc 


tcaaattttc 




aaactaaacc 


agttttctct 


gctttatttg 


61 


ttttttagtc 


aactattaat 


ctttttcttt 


cttctagccc 


ccaaacgctt 


ggagtaagta 


121 


acttacacct 




ctttctttgc 




cctctgcagt 


ttggatgttg 


181 


ataaactgtc 


ctagaactac 


tctctcaaag 


acagaggatt 


ttctgtgtcc 


cccaagtctt 


241 


tgYctcttgc 


atttgctatt 


attgcaggac 


aaggacagag 


ctctctctgt 


tgcaactcca 


301 


gtaccaaaca 


ttacctgtga 


gttagaagcg 


gttcagcaaa 


tctttgttga 


atacagactc 


361 


tgtggtcttt 


tcctttcaaa 


aacttcttgt 


ccatccatcc 


ttatgcatgc 


cctgtgctcc 


421 


atcctagtcc 


aagcaccctt 


tgtcacacct 


cctatacctg 


gtaacactat 


tctggcttgc 
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481 


ttcttcaatt 


ccaattcatc 


ctgcagagta 


tcatttcatt 


tacatcttcc 


ttgggactca 


541 


cccatgactt 


caatgactat 


tgctgcaaac 


tcaatcttct 


gaattggtat 


ttttaaaatg 


601 








attaattata 


gatgcacaag 


gtaaaactga 


661 


acaaacagat 


atctagagaa 


agagaagtct 


ttctctcatc 




agttttcttt 


721 




atcactgata 


gcagtttctt 


gtccctcctt 


gcagagactt 


tctgtgctag 


781 


acaattgctt 


cctgattgga 


ctatggctca 


ccatcatgcc 


tggggccctg 


tcttgaatcc 


841 


tagccagttg 


ggctagggtc 


ttgtttacct 


cctgggcatt 


ctcccattgt 


gggctgtgcc 


901 


atacatcccc 


ccaggcaatg 


atttatgatt 


tgcttcaatt 


ggcatagtct 


aagatgatcc 


961 


ttgctcttag 


aagagtatct 


gggactccag 


cggctgtgac 


cagacatgca 


ggtaacactc 


1021 


ccctgtgggt 


attatgaggt 


attccaacac 


tgttcaccct 


cactggggtg 


gacttccttt 


1081 


ctggtcagtc 


ctctatccca 


accacgctaa 


ttctggcttt 




cagcaaccag 


1141 


aaatcgaaga 


aagtcctaac 


tggttttact 


tttgaactgt 


ggtgagttgt 


gctacgtgtt 


1201 


gacctgattt 


gccaacagat 


aaaatgagat 


gaccatgttc 




gcactattgc 


1261 


ttcactgtct 


agcaggacct 


ctctttagat 




caggatttga 




1321 


ggggcgttag 


acagctaatg 




gtgtatgtga 


atagtgatag 


catctgggta 


1381 


tgggaagctt 


aacgccattg 


tagccagagg 








1441 


cactctggga 


gcctctgact 


agaccctaat 


qacaqcctta 


aqttcqqaaa 


atatccctqt 


1501 








tggggaacat 


qaaaattact 


cttta ttt 


1561 


gatgcaaagt 


accttaacta 


caaataggaa 


gacttttttg 


tccttttaaa 


qtccacctaa 


1621 


tgaacaacag 


gcctgaaaag 


gtacttaaaa 


tattgatttg 


gaggtacatt 


ttqatcaqtt 


1681 


gaactgcgat 


ataattggcg 


atcgaaaga't 


gacacatcca 


tqaqagaqac 


taccccaatc 


1741 


cagggtttct 


caaagtgtga 


gccatigga'ta 


acagtcttgc 


aaaatqaqat 


ttattqqtca 


1801 


aatiatzatiagg 


aaaaaactgc 




ctttccttct 


tccatattca 


caatgtacat 


1861 


tagcatatca 


aggattcaga 


gaagtgtgtt 




cttctaaact 


tttaaattaa 


1921 


ccctttgtct 


cccaaactta 


tttggcgaaa 


aaatgtttta 


tacataqtqc 




1981 


cctaaagaac 


taaggttcag 






qctqcttcaq 


tttattcatg 


2041 


caag't'taaat 


gatttacggt 


catgcgatgc 


acaatgacg't 






2101 


atggaatgat 


cgtcccataa 


gattataata 




actgtacttt 


ttctatattt 


2161 


agatatgttt 






accacggtgt 






2221 


agtattgtaa 


catgctgtac 


aggctagctg 


caataggctci 


taccatatat 


aqrrcaqqtq 


2281 


cataatagga 


tataccatct 


aggtttgtgt 




catqatqttc 


acacaacaaq 


2341 


aaagtcatct 


gaaatgtact 




tatccccatt 


gLtaagagag 


acatqactat 


2401 




aaataactgg 


ttcctctaaa 


taattcacat 


attcatcata 


ttacaaacca 


2461 


ataatacaat 




tataaaccaa 


taacataata 




gtgaaacatt 


2521 


taggtgaacc 


agttgcaata 


aagctgtttt 


ttaatattat 


ctgttatcta 


acatttacat 


2581 


ttgctttttt 


tttaaagtca 




aatcgatttt 




tgagatggag 


2641 


tattgctttg 


tcaccaggct 






cggctcactg 




2701 


ctcccgggtt 


caagcgattc 


ccctgcctca 


gcctcccgag 


tagctgggac 


tacaggcaca 


2761 


tgccaccatg 


cctggctaat 


tttttttatt 


ttagtagaga 






2821 


caggatggtc 


tcgatctcct 


gacctitgtiga 


atccgcctgc 


cttggcctcc 


caaaqtgctq 


2881 


ggattacagg 


tgtgagccac 


cgcacccgcc 


ctaaaaatct 


atttctttat 


aagttaaagt 


2941 


cttattagtt 


gatgcctttt 


ggtccttatt 


ttctagtgaa 


ttcacctgga 


agcctcctaa 


3001 


acctctggca 


ggaaccagag 


ggtttgctac 


tgctcactga 


tgttttttcc 




3061 














3121 


caataggttt 


ttggggaaca 


ggtggtgttt 


ggttacctga 


ataagttctt 


taqtqqtqqt 


3181 




ctggtgcacc 


catcacctga 


gcagagcaca 


ctggacctaa 


tqtqcaqtcc 


3241 


tttctctctc 


accacttccc 








tatcattcta 


3301 


ttttttcttt 


ttttagacag 


agtctcactc 


cgtcgcccag 


gctggagtgc 


agtggtgcaa 


3361 


tctcagctca 


ctgcaacctc 


cgtttcccag 


ttcaagtgat 




cagcctcccc 


3421 


agaagctggg 


actccaggcg 


cacgccatca 




atttttgtat 


ttttagtaga 


3481 


gatggggtct 


caccatgttg 


gccagactgg 


tctcaaactc 


ctgacctcaa 


gtgatccgcc 


3541 




tcccaatgtg 


ctgggattat 


gggcgtgaac 




agctgttgta 


3601 


tcattcttat 


ccttttgcat 


cctcatagct 


tagctcccac 


ttataagtga 


gaacatatga 


3661 


ttggttttcc 


atttttgagt 


taccaaaata 


ttcttttcat 


tacaatcagc 


ttctaatcca 


3721 


cagtggtcca 


aaccaagttt 


tcctcaggtt 


ttttcaaaat 


cctattatca 


ctcataagtt 


3781 


ctaggtcaga 


atgttgtctc 


tccgttatac 


aattggttat 


attatctcat 


ataatttctt 


3841 


aaaaagcatt 


gatttaaaaa 


caaaaatcac 


acataaatga 


aaattgacat 


ttcataagac 


3901 


catgtacaca 


tatttatgaa 


gacagaacaa 


ggactaaagt 


ataggttatt 


gatttatcaa 


3961 


gactgctcct 


ctattgtcat 


tcagagaatg 


gtcaagaagc 


cagcttctct 


gagaatccta 


4021 


tcttgaggca 


tagattctgc 


agtttcttta 


ttccgcagga 


actgctgtag 


aaaggagtac 


4081 


ttaaataaag 


tctccctggt 


ctaggagcca 


gaactccacc 


ttaatctggg 


agaaaacagg 


4141 


gaaataaagt 


ggccagaggc 


tagtagctac 


tgctttgttg 


cttcaagaga 


aggaagtatg 


4201 


agcctctatg 


aaggggttga 


ggttttttgc 


tggggacttt 


ggggcacaga 


atgtaaaaaa 


4261 


aaaactgcgt 


ctggacaata 


gtcagctatg 
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4321 cagtggcttg tgcctgtaat cccagcactt tgggaggccg aggctggtgg attacctgag 

4381 gtcaggagtt taagaccagc ctgatcaata tggtgaaacc ccgtctctac taaaatacaa 

4441 aatttagccc agcatggtgg tgtgcaactg tagtcccagc taattgggag gctgagacag 

4501 gagaattgct tgaacctggg aggcggaggt tgcagtgagc cgggattgaa ccactgcact 

4 561 ccagccaggg cgacagagcg agacactgtc ccataaataa ataaataaat aaaggggcca 

4 621 ggcacagtgg ctcatacctg taatcccagc actttgggga ggccaaggtg agtggatcac 

4 681 ctgaggtcag gagtttgaaa ccagcctggc taatgtgttg aacctgcatc tctactaaaa 

4741 ataccaaaaa aaaaaaaaaa aaaaaaaaat tagccaggca tggtggcagg tgcctgtagt 

4 801 cctaactact cgggaggctg aggcaggaga atcccttgaa catgggaggt ggaggttaca 

4861 gtgagactcc atctcaaaaa aaaaaaaaaa aaaaaaaaag aaaagaagat tagttgatgt 

4921 gagcacagct taagaaaatg cttgaaagca ggtgactaga cttatgtatg actaacttcc 

4 981 ttttaagaaa accaagggtc tcagcaagta cagtagttcc cagagttttt gatttcagag 

5041 ataaacaaca tcaaaataat tggtaactga ctttagtaac ctccttttgt tattttttga 

5101 aaaaggatac ttaaaaaaca ctacaattta ttgttaccat tgttttggtt tttttttttc 

5161 atcagaaaaa aaaatagtga aggcacagcc tcacaattat atataatttt tgaaattaaa 

5221 tacattcagg ttcatgaaaa actcaggaca ttgtcctatt tttttttcct cattttatca 

5281 gggtcatttt ttcatcaact agctgattca catagaaaaa tcctgaaaca catattattt 

5341 atcttcagag aaattcatgt tgatgttact agttgcatca aatatctcag ccagacaaca 

5401 ttatttgttg aatcttccag gcagcggatt tttttaaaag ggcctgattc ctgcccttaa 

54 61 tagcaaaagg gtgctgtttt cagaataagt aaacagggta ttaaacaaaa gagagatgtc 

5521 catcttcttg aagaaatagt gaagaagcaa gacagcaagg ggaaatggct tgtcgagttt 

5581 ctttaatatt tagaaataat caagatcatt ttcaagggtc aagtactctc tcccatttta 

5641 agaaaaactc cttcattcag ggaaatcatt tgaaacttta aagatgtatc aacaagtacc 

5701 ccataaattt atacaaataa aaaaggacat atcaacaggt ggcaaacctg ttaaacacat 

57 61 ggtctattat gcaatactta attgattact aatattaatt gtgcactcat tatatacaaa 

5821 gtactcccct cgattctgag gcatggggag agagttgcaa agatgaatag gtccctattt 

5881 catggaattt acagtttagt aatctgtaga tgggtaaaca aaccacaatg agtaggtcaa 

5941 tctaacagag ggactgatca tttgttaaaa agcaaatcgt gggctcatat gcaatccatt 

6001 attttcttta aagaagaatt agtggtcttt taaaaaaata tgtatgcttc ctatgatgca 

6061 tcacaacgac tcttagtctg aactaaattt gcagtcagcc agaagatgtc ccaaatcatg 

6121 tcctgactgc tgatcacaca gactaccacc tggtttcagc tgaagattct tttaattgga 

6181 tttttttctg tgtttctgtt tttcactgga ttttgttttt ctctgttcct ctccctggag 

6241 gaaagtggaa atttggttac ttttttgtga tggaagtata ctttcattta ttattattgt 

6301 cgttattatt atgaattttg aagccggact tttaaaagcc tggaaggctg cggcgaatgg 

6361 catgtgagag gggaggtttg cgggaggata ccagcaggtg gcgcgtgtct cctccgggca 

6421 gattagaatg ctttgctcag ggccaagaat cccgcaggaa ttaagggcaa ctatctgtgc 

6481 agtcaattca agtgacacag atcatgccat acttaagtgc cgagtccatg aagtcaccag 

6541 ttccagcctg Ltgtctaggc catcaaactt atgtattacc acttaaaaaa ccagctcccc 

6601 tactccaagg gttgcattac tttcgtgctt gacaccccga caaccccgca cccacccccc 

6661 gactcatttt ccccaagcca agtcgctcat aggggacaat caagccttaa gaatagagca 

6721 aaaataaagg ccctggaaag ttggacatga agctttaact tcaggaaagg aatggtattc 

6781 tattaatcgt ccagctggtc tgcccttagg gtgtcagatg ttctctgtcg cccctggtgt 

6841 gccagcattg gcctgggtgc cactgggtgg ctgatggcag aatgtgaatt cacttgcaag 

6901 cggacatgtt cgtttccatg ccccaaagcg ggaagaagaa gggcacaagg agacttcgat 

6961 ggttctttca gctcagactc gagctgtgac tgatttgctg tttggggtca ctggctttct 

7021 agcctctcct ggcaccaccc aatttcgggg agcagccagc tcatctttct cctacgacaa 

7081 tgtcttctgt cccccataaa cactgccctc attttgagtt tttctttcct atgtctgtag 

7141 ctttcttctg ccgctcagca aacactctgt cccactcgta ttcgtccctt ggcgctggcg 

7201 tctggcccag gggtacactg tgaaggaggg caaccctaga ttctgagggc cctagctcga 

7261 atctagcgcc cctctaactg caggggtgac cttagataca tttattaact tcctctgtag 

7321 actcagggga acaactttgg cactcctggg gtccctggat taatgaggta acttacgtag 

7381 actccctaaa caagcgtctg gttcattgta agcgcccaaa tgccaggggg catctgcagg 

7441 gatgacccca tagtgaatga aaactgcctc ttggggatct ctgggttaaa cggccagaat 

7501 aaatcgcaga attgagagaa ccctctcctg ccccaaacct cactgtactg taaatcccat 

7561 tcattgcctc ccgccgtttt atccgtgttc tcagaccacc cccctatccc cgcaacctcc 

7621 agtcccctaa agcctcctgg gcggcaaaag gacgcccaga gagaggttcg gcgcctcagt 

7 681 tgccccggag cagacgtcca gccccgccgt gtacccgatt gcggggcgga tcgcggcaag 

7741 gcggcggcgg ccggcagaat aggcgcggag gaaggctcag gcggggcaga ctgcgtgggg 

7801 gaaggaggag gagagagcag acggcggagg agggcagagc agccgggggg agggcgcagg 

7861 ggcgggagga gacacatgcg cgctgccgcc gccgccgccg ccgccgcagt ccttagcttc 

7921 ccggggacag gaaaccttca agaccgagct gccacggccg cctccccgcc cgccccccat 

7981 tctacgcgcc tgcccacacc ctcctcccct ccttccagcg cctttcggtg gagcactgcg 

8041 gcactcagcc cgagctgccg ttttcccctc gcggggaacg ctgtgacccc cccgcaggag 

8101 cggcggggcg gggtgggggg gcccgggaga agatggcgac gccgggaagc gaaccccaac 
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8161 ctttcgtccc ggccctttcg gtagctactc tgcacccact tcatcatccc caccaccacc 

8221 accaccacca tcagcaccac ggaggaaccg gcgcccccgg cggggcgggt ggtggcggcg 

8281 gtggcagcgg gggcttcaac ctgcccttga accggggtct ggagcgcgcg cttgaggagg 

8341 cggccaactc cggggggctg aacctgagcg ccaggaaatt gaaggaattt ccccgtaccg 

84 01 cagcccccgg gcacgacctc tcggacacgg tgcaggcagg tgagtgaggg ccgaggggcg 

84 61 ggcaggggtg tgggtgctgt ctgggtgtct gtcgtgcgtt ccctaacgcg gtggacagtc 

8521 ggagatcttg tcttgctggg ggagggaggg tccatcggcc cgttgtctcc cgaagaaggg 

8581 actcgcgtgg gcgcggaaga agcgggccct ggagagggca cgggggccct gcctggtccg 

8 641 gcgatgcagt gccaggaggg caggggtgcg ccgggcctct gcgcctgaga gcgaggggtc 

8701 tccggctctc caccctgtgg ctgccgcgcc agggaagtaa ccgcggggct gggacttgca 

8761 ggcgcgccgc gtgcgcgcag tgtaagtggg aggtttcagg cggctgtgtc ggcttcctgg 

8821 gccccgcgca gggctgggaa ctccagcgcg gacagcggcg tcggcgcctc caaccagttc 

8881 aagccgtctt tgcatcgagg cgtagcccat cctggtgggg aaacccagac aaaggtggca 

8941 cgccctcggc cgagccaggg gcatcggcgc agtgccgcga gccccctcgg cgacatcgcc 

9001 gagcgatcgg gcaactcggt gccgcctgtg aggtgcccaa gtttcctctc cctgcgcttg 

9061 tgcgggaaaa gagccgccgg gcttgtagtg aatcccagtc ggctgccgct tcggaagcga 

9121 tcggccctct tccttccgtg ccttttgctc accgcccaca cagtttgggt cctaccaccg 

9181 aggctacctg gttctcagtt acccaaccgt tgcgggggcc ccaggaaagg acggcgtggc 

9241 atattaaagg caaagtaatt aacgtgagcc tgttttcctg tgtgagcccg gcaagttgta 

9301 actcgcgcca ataagcagca taactttttc aagttacgtc atgtgttaca tacttctaaa 

9361 acgtctgctt tctctttggt agttggagtt ccaggagcgg gatctataaa caggaaggag 

9421 ggtggtggaa taggattggg gccttagggt ctatacagaa ctgacttggt ttccggccat 

9481 ccggttccgc tccctggggg agggtttgtg tcatagcaag cgcccaacat ttcccagggc 

9541 agtgggtgct ccgttttgga gccgctgccc agacctcttg gctgtcatcg cctcatggcc 

9601 cagggtcaga ccccctgggt cttggatggg ttgactctca tcacaggagc acccagttta 

9661 atcttgtagt gggcaaggga gcaggcagtg ggtttttgga ggctaggttt ccagtgtctt 

9721 ttcccccttt taattcaata aacatttatc tagagcctta atgaatattt agatacagtt 

9781 ctgagttcac tggaggcaaa gatggctctc cagggtcatt aaactttgta atttggattt 

9841 agggatagtc aggaatcggc aaaaaagagg gaatttccaa agttgagctg tcaacagctg 

9901 gggagtgtct ttcaataaga cctcttagtt acatcttctc aagtggaact tattccaaga 

9961 attgcctgaa tcactgactt aaaaaactac attaaatttt acttcccttc tgctatagag 

10021 tgtaccgtga gctagctatg ctatggttgg ctatatctgc agaaaataga gctggcatga 

10081 ctaataataa taggtttgtg tagttcagta agggggcttc atagttgttc tacttcagtt 

10141 atgctaagga ttatttacaa agttaggtga acttgatgtg tctgctgtgt tggcatttct 

10201 ttgagaaRac cagtgctttt agggatcaca tcttttagct aagacctcgc catgtgaagt 

10261 ggatgctgaa agtgttcttg ttggcttgtt tatggcaagc ttaagttgtg agttttgcag 

10321 gtttttattt atgagctcac ttgcacatgt tttatgctca ggaaaatcta actggttaaa 

10381 attcagaggc tttatgttaa ttcctcatcc aaaaatacag tattatttgg gggaagagga 

10441 gtggagggat attcatgttt gtagaaactt aagtcaaccc catcagaaaa gtattcattg 

10501 cagctagtta ttttacatga aattagaata cccaaatgta ttaaatattt gcttctgcaa 

10561 aacgtctttg cctcaggtca aaagctaggt gtaaaccaca tgtactcttt gttctaaaga 

10621 taaatttaat tgagcacagt ttcttatttg agagaccaga aagcaaaata aagagttaaa 

10681 gttacctccg gtgtacacag tgttgacaca gcgtacatgt aagagttcta gtacagttga 

10741 aagtttgagt agtaacattt ataataaaga tagtggtttc tccttgactc atatttactt 

10801 taagaatgtc tcttaaaatt agaatggcat atgaagatcc ttaaaggcct aagaggcaaa 

10861 gtggattttt ttttccatat attttctaag acagggagtg gtagagcaag ctttttctca 

10921 cagttttgtt gtatattaca ggccatttgg tttacgcctt gtaaaaagtc aggcttctgg 

10981 cagccacctg tgcagaactg cacctgggca atatagccca tctgaacaga gtggaaaacc 

11041 aagatgtaaa caaagaaaca tagaatagat gtctaagctg tcactcaagt gcatatactt 

11101 tattgatagg aaaaaattca agctctcatt ttgagcctat taatttactt ctatttgaat 

11161 caaaataagt tttgttgtca gagtctagtc ttttaatagg tgaaacgggg aaagaaagac 

11221 tgatccacgc aggtgcagtg agtgacacag gtgtcactga tggaaaggga caaaacagaa 

11281 gagccactat cagcactggg tgcgcctctc tatgcgttat ctgcatgatc ttagtcccca 

11341 ctaggagccc agtcataccc tcatttatac atgggaagac tgagacagag acaagttaca 

11401 taatatgctg aagattatac ctctactgag atggtcaacc ggtgattgga accctggcag 

11461 cctcacgtta attcatgttc ttgaccatat tctcatcaag cagagggaag gggttgagag 

11521 atagaatctg ggaggcttcc tggaaattca ttcaaccagc attgattcca caagaatgta 

11581 ctgagcccct ctgagtgctg gccactgatc taggcactgg ggacacaaca gtgaccaaaa 

11641 gagaatccct gtatcccttc ttgtggagtt cattctagtt ggagaataga gagatacatg 

11701 aataaattga ttttgcagga taaataggat gagttacagg atagatggaa tatctttgtt 

11761 ttataaatac acagattgct ttgtattgac ttcaggtggc ttgtagcaag agtataaatg 

11821 atgactagta atacatataa tctgaagcat aaaattgggc caaagaaaaa caagtacaca 

11881 aatttgcata tgagtaaagc aaattttttt tttttttggt gctgatttat taagtagtgt 

11941 cgttttgaat gaagaagttc aaaagggtat tttgtacaat taaaagtgaa ttgaaaagga 
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12001 gtaagaaaag aaagtggctg attgcaggag gttctagaaa ataaaaaaga agcccttaga 

12061 cttggcatgc aggacacttg ttgaaagtct tgttaaggaa agtggggaat atttcattgc 

12121 taattatctt tttgagttgg ctctaagcag gaaagactat gggccaggca ctgtggctca 

12181 cacctgtaat cccagcactt tgggaggcct aggtggaagg accacttgac cccaggagtt 

12241 tgagaacggt ctgggcaata tagtgagacc ccatctctac gaaaaataaa atattaggcc 

12301 aggcacgata gctcactcct gtaacgcctg cactttggga ggccaaggtg ggtggatcac 

12361 ctgaggtcag gagtttgaga ccagcctgac caacatggta aaacctcatc tctactgaaa 

12421 atacaaacat tagccgggca tggtggcatg tgcctgtaat ctcagctact caggaggctg 

124 81 aggtgggaga atcacttgaa cccaggaggc ggaggttgca gtgagctgag gttgcagtga 

12541 gctgaggtcg caccactgca tgccagcctg ggcgacagag agagactcca tctcaaaaaa 

12601 tgaatgaata aataagtaaa taaaataaaa aattaggcgg gtgtggtggc gcatgcctgt 

12661 agttccagct atgcggacgg ctgaggtggg aagcccatag gttgaggctg cagtgaaagc 

12721 catgatcgcg ccactgcact ccagcctacg tgacagagtg agaccctgcc ttaaacaaac 

12781 aaacaaacaa acaaacaaaa caaagactat gtgggaaaca aagagcttac tcagggaaga 

12841 atattgaagt gcagcattaa aaatacccat ttatgtgtga gggaagagtt ctgtaggaga 

12901 tgcaaagtat aagacaatac aggaatacat gagatgctaa ataacagttc caggaagcca 

12961 ctgaggccca aggcaggtca aggtgaggtg ccaagtagca ctgatggaca ttgtgtgtag 

13021 aattcagagg aaaaggtcat tttcatgagg atgcttagga aagactttga tagcagaggt 

13081 aggatttgga tcaggttttg taggatggat aggatttaga tagaagtgcc tatgaatatt 

13141 ctgtgattgg aaggatggtg ccaagacgtg gagtaagaac ctgacagttg gccgggtgtg 

13201 gtgactcaca cctgtaatcc cagcattttg ggaggctgag gcgggtggat cacttgaggc 

13261 caggagttca agaccagcct ggccaagacg gtgaaaccct gtgtctacaa aaaaaaaaaa 

13321 aaaaaaaaaa aaaagctggg ttcctgtaat cccagttact cgggaggctg aggcatgaga 

13381 attgcttgaa cctgggaggt gaaggttgta gtgagccgag gttgcgccac tgcactccag 

134 41 cctgggcgac agaacaagac tctgtctcaa acaaacaaac aaacaaatga acctgacagt 

13501 ctaagattct gtgaataaaa gaaacagaga gggcatttaa gcctgtcttt ctggtcatat 

13561 tcttctaccc aactcctgcc ttataatgga gtaaaggcta cagggctctt atgacccttt 

13621 aattgaggac tccggaaatg gctgaagcct aggatttgtc agcctgtggg aataacaccc 

13681 tagctctgtg tgatctggct ctctcaaatg tctgtacaga ttcaaaggat cttctctgag 

13741 atttgtgggt ggttttatct cttgaaggct gacttccaca ccctattccc atcctaaagc 

13801 aatggagtcc ttgggatgtt agcactcttt agtcgttgga attcacagaa gacttgataa 

13861 caaaatctag tcattttcaa caacttaact ttatttcata atttttgcca tcaatatgtt 

13921 tcactataaa tataataaaa tttttgaaat cacatgtgtt tccttaaaat gtatgttctt 

13981 ttatagttat atccagtttt gaattatcgt tctgctattg aattaaattt ggtagttttt 

14041 aacatacctg ttgctttttg tggtggttac tatcagataa atgcctttgt ttattacatg 

14101 tattctaaat attcttccat tcataaacta tgattcatat ccttttcttt cctattacta 

14161 ttgttttttc ttccctgtta aggtatagat aaattccaac atcaaacaaa agcttttgga 

14221 acaactcaga caatatctgt agtggttctg aatcacgtct agatgggccc ttccacatgt 

14281 ctgtggtata tggcaaaaac agtctccaca aacctacatg ctattctcat tgcttgcatt 

14341 acaggctgct ttcttgaaaa ggcaaggaca tactaatcat gcttctagaa taatagcata 

14401 caattatagg ccagggaacc tctgggaatt tttggtctgt gccagtttat aaaggaacag 

14461 cccaaggttt gaccttcatc actgagatga ctctactgga gagtttgtca gtcactgcaa 

14521 catctgattt gatttttggt aggtgtttga tactaggcca caaattagca caccctgtca 

14581 tttaaaaatt cttgtttaaa ttattgactg agactttttt ttgtgtgcgt gtgttactgt 

14641 aatgaactga aatttcccat ggcatggcat aataactctt tgtaataaac atttaaaggg 

14701 tagatacctt ttgaaatcca tgttgacgtc gattaattca gaagtcaaac ttgttttgcc 

14761 cgtgctgtcc atctggtata tgtacataga ctttttctat tatatcattc aacttcatca 

14821 ttgcttgttt aaacacaagc tattttcctc ttttgtgtat acatttctgt tcccccatat 

14881 atatagactt tttatatttc agctgggcag catttacttg ctcttaatgt ctccccataa 

14941 tttcccttca ggtgtggtac tgcatgtgct ttttcaaagc cttgataatt taaatggcag 

15001 aaatgctatt attaactcca tataaggtgg ttatgggttt gtcactcaRg ccatgtgtaa 

15061 cttgaatttt tgtctaagcc tcagaaaaac ctgtttgtga ggccactttt Yagaatttct 

15121 tgggcttttt ggaaaattga acgtctgtct tggatattgg cttgagctta gatatggcaa 

15181 ttccaccagt gattatttca gtgattgtgt gaattgttag aattaagata atttggtgtt 

15241 aattgcagtg attatttcaa gttataagat tRtaatctat tataaaYgtg ttaaggatgg 

15301 aaggcacaaa acccaactat ttttcYRtag tgaataatct tttttaatgc ccaactcata 

15361 ggaagcctag cttctgttat tgctactagt cttttgttgc atgaaagatt tttgtttttg 

15421 catcagaatt ttgccatgat atttcagaga gtaagaaaag aaaaaaagaa aagaaaggtc 

15481 agggttccaa gtcagactta atgtaggaaa actaggcatt ctaatataga ctcaaacatg 

15541 aaatatgcct gtgttattct tcatcctaat ttaagagttt agtgatattg cttccatatt 

15601 ctgcagtaag aaatagtaca tttggaaacc atctcctccc ataaagtaat tcaaaaataa 

15661 agtgagattg tttacatgaa tagaaagttg atgaactgtg agaattgggg cttcccttgc 

15721 ttactccagc catgcaatag aagggaaatg cttttatagt cagatttctg ttcactgttt 

15781 ttcctaacca tatcctcctt cactggccat aaaggaggat atagctggag aaaaaaatca 
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15841 


gagaagaaca 


gtatgtttga 


tttgcataac 


agaaatactt 


catattttgc 


ttggtaagaa 


15901 


attattgctt 


ctctgttaat 


atgatgctgg 


tgctgaggca 


gctcaactga 


ctatacctga 


15961 


aattcttacc 


gaacattttt 


aagtaatcgg 


ggacaggtta 


tttgattttg 


ggatcttcct 


16021 


agatgtttgt 




tctgctgttt 




ctcagtccct 


ctgctggagg 


16081 


gggacctaac 


acaggtgtgg 


gatgaaactg 


agcaggagcc 


ttctagctca 


gtgtgtcttg 


16141 




ccactgttct 


tgccaaatgg 


gcagaaagtg 




tagtttttct 


16201 


aatcttctgg 


cttctcatgt 


ggacttgctt 


ttigacagctg 


acccctgcag 


tgtggtatgt 


16261 


atagctttgg 


cacacgatg't 


gttcctctgt 


tattcttccc 


tatttttgat 




16321 


tacagcg'tga 


ggagtagcat: 


caatatttct 






actctaattt 


16381 


aactacttta 


tctctigcagg 


caatggtggc 


aattcaaaga 


cagcaccatc 


aacacactgc 


16441 


ttggcttgcc 


cttcctctgg 


attgccttca 


ttttgctttt 


ccaagtcatt 


tacctcatgt 


16501 


cccttctcct 


ccctgttctg 


taggattcta 




tctcatttca 




16561 


cagcctgtca 




cctccctatc 


agccctttct 




aqcactqaat 


16621 


catacatgac 


cttaagccat 


ctttgcatta 


tttccttatg 


gcattttttq 


ttttattqcc 


16681 








ttttgctcct 


tcactqqcct 


qtqaqatttq 


16741 


gctagacggg 


acttttcctg 




ggcttccatg 


cccaqqacaq 


aaaatqctaq 


16801 




atgqattact 


ctctttcctc 


tttcctacac 


ttqqtctctq 


catttttcat 


16861 


ccatcatctt 


ggatctaccc 


taaaqttatc 




qcataqtttq 


ct atct ct 


16921 


cagattttcc 


tgtgagagct 


qatttttctc 


acataqcctc 


tttttacttc 


tcaqtqtcat 


16981 


ccactgtatt 




qqqcaccatq 


acaatattcc 


tcaqactqqq 


tqqcttaRac 


17041 


aacagaaatt 


aattttccca 


tagctctgga 


qactaqaaat 


ccaaaatcaa 


aatatcoaca 


17101 


ggtttgattc 


tcctgaggcc 


tctctcttgg 


cttacaaata 


qaqqttqtct 


tcctaaatct 


17161 








cttgttgtcc 


cttccta a 


ccaaatttcc 


17221 


tcttcttata 


agggtatcag 


tgagattgga 


ttgggaccca 


cccataaaac 




17281 


cttcattact 


tctttaaaga 


ctttgtctct 


aaatacagtc 


acattctgag 


atactagaaa 


17341 


ttagtgcttc 


cattactttt 


gggggtcata 


attcagccca 


taacgcccac 


tgactcaccc 


17401 


ctgctgtagc 


tggtagctgg 








ttaaactt 


17461 


gtgaggctgc 




actcttacaa 


gaqgcccaag 


acaqatctqt 


ttqttaacca 


17521 


tcctgtcctt 


ggaatctagc 


cctgcacaca 


ctaactacac 


ctacatccac 


gctqaqaqtc 


17581 


ccaataatgc 


tcctgagccc 


tcgaatgtgc 


tgtaaaatgq 


taaqaqaqcc 


acatatacac 


17641 


ttacaaatgg 


atgactccgg 


ctcatctgga 


cccctcctcc 


atctctcata 


qacttaqaat 


17701 


cttttcggtg 


gttggtgggt 


ccctagagtc 


acactttatt 


tttaqctaaq 


gtqqqqtcac 


17761 


ttctcaccaa 


catttgcaag 


tgttcgataa 




atctaaaatc 


aaatqaqtta 


17821 


tagtgaattc 


tgagggagat 


gggcaggtgt 


tttataatat 


caaatattqt 


ttqaaattaq 


17881 










qtaaqqaqca 


taqccaaaaa 


17941 


tataaaggca 


tagttccacc 


tagtttgaat 


tggatgagga 


ttaaa aaat 


tat tatttq 


18001 


aaagtccctt 






aacaaqtatt 


atttcaccaa 


aqcctttcaq 


18061 


ccctfcagcca 


aatggaattt 


cttccttttt 


tttttaagta 


tagcttagtg 


attaaaaata 


18121 


gggtttcctg 


agcttcaaag 


acctgtgttc 


aaatcttggc 


cttgttgctt 


ataaactatg 


18181 


tcactttggc 


cagatgactt 


aaaccctatg 








18241 


aataatagta 


tgtacctagt 


agagt:t:gt^ca 


'tgaggaca'tti 


ggttataata 


acaacaacta 


18301 


acattagcaa 


gcagttattc 


tatgtcaggt 




gtgctttata 


tgcaaatata 


18361 


tgtattaaat 


gtatgtaata 


gatgtttcac 


ctcatttaat 




tqqtctqtat 


18421 


taactagagt 


gggggctcaa 




caaLgccttt 


ctattgcaac 


ctqaaqaqqq 


18481 


agattagaaa 


tgtagaaacc 


atccattcat 


atctctccct 


catttaataq 


ctqaqqacac 


18541 


taaggcccag 


agagtataat 


catttggcca 


agggtacaca 




aqqtqccaqa 


18601 


gccagggcag 


agtctcccag 




aagggtcgtt 


tctgtggagc 


caqqaqqtqa 


18661 


cattgggatg 


aagggaaggc 


tttccctcat 


ccatagggtt 


ctcaggtcaa 


acccacccaa 


18721 


accagccagg 


ttgtgggcct 


cccaaaaaac 




ggtgtgaaac 


ctttaataac 


18781 




aacctgggga 




aagctaacgg 






18841 


tagtgtagga 


tttgactgca 


tgtggtttga 


gggacatgca 


caggcctctt 


gtcaactgga 


18901 


ttccacatac 


taaaggagtg 


tgctgggcag 


cccccgactc 


acaaacgggg 


ccatcagcct 


18961 




ggctttgttt 


cagaacttgg 






caaggtggta 


19021 


acctgtgatt 


tggttctccc 


gccaacccac 


aaagccctgt 


ttaaccctag 


cagacctggg 


19081 


ccttcttatt 


aatggtggtt 


tcttattgtt 


tctttggaaa 


aatgcaatcc 


cagttccaac 


19141 


atgaaatccc 


gaatctggag 


aacttggcat 


caggctgtga 


atgaggcccc 


tgcctcttga 


19201 


aaataattct 


ctcccctctg 


gccattccgg 


ccaggggtcg 


gaggttcccc 


acacctctcc 


19261 


caggtccctt 


gatcagggca 


ctcagcggct 


gggaaacaag 


ggttggcata 


agtcagggat 


19321 


ccacttggac 


ctcaggaagg 


ccactgggcc 


agggaggaat 


aggccgggga 


gccacccacc 


19381 


tgagggaggc 


Rtgcaggaac 


ttagacactt 


ctctatttaa 


tttttatatt 


tcttctgtat 


19441 


tatctcccca 


acttactaat 


tttggaattg 


gaagcaagtt 


agtgaacttt 


tctgaacatc 


19501 


aggtttgtaa 


tttgttagaa 


gcgaaaatgc 


tctttatccc 


ttgggtggtt 


gtatagattc 


19561 


aaagggtaat 


agatagcaca 


ggagcagggc 


tcaggtctgg 


caaacagtga 


gtgactagta 


19621 


gaaaacctgt 


cactttctgt 


ccttaatttc 


catggggcta 


tgtgacatag 


tcttcttcag 
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19681 tcacgtagat aattggaagc tgtcagccta tcattagcat agaggctaaa gcacaaactg 

19741 tggctgggtc tttgtccctg ctctgccagg acatggctac tacttcactt ccctgagtct 

19801 cagtttcctc gtcataaaat ggggataata attatagaat tattgtgagg aacaaatgaa 

19861 tacatatgtc aagtgcttag gtcagtgcat ggcactgatt ttgccattat gattattctg 

19921 tatctttaga aaagagagtc agtgttataa ggcacagaaa gactaaattc ctagaaatgc 

19981 tgtgacatta ataaaataac atcagtatcc agcaagcttg ggtgtacatg ctttcagtgt 

20041 gtgtacatgt tatttatatc ctttcggcag tcttgggagg atggtagcat aggtggtatt 

20101 cattattcca tattgcatgg ggggaaacag gctggatggt gaaatgactt tccttcggtt 

20161 atgctgctag tcagcagttg agacaagacc acatcttctg ggttttcatg ctgtttttat 

20221 tcctcaattc aacggtgttc ttaggtttga tttaatttct gctggggccg ggtgcaatgg 

20281 ctcatacctg taatcctagc aatttgggag gccaaggtgg gtaaatcact tgaggccagg 

20341 agtttgagac cagcctggcc aatatggtga gacccgtctc tactaaaaat acaaaaatta 

20401 gctgggtgtg gtggtgggca cctgtaatcc aagctactca ggaggctagg gcaagagaat 

20461 cgcttgaact gggaggcgga ggttgcagtg agctgagatc acgccattgc actccagcct 

20521 gggtgacaga atgagactct gtctccaaaa taataataat aataataata ataataataa 

20581 taataatgac aacaataatt tctactggga cagaggagtc ttgaggggac tgagtctcgc 

20641 agagaagagt gtgttcaggt ttctcacagt agagaatggg gaaccctatt cctgtgccac 

20701 gttgtgtatt caggtgaagg gtgctggggc cagctcttgt tttggcctcc agattggagg 

20761 ggtgaggaag tggtggctgg ggcctcagag gagggcctgg ggtgcctgtc ttcagtctct 

20821 acggaagtct tgtcattgat ttaaaagctc ttccaagtta aaaaacaagc aaacaaataa 

20881 aaaaccattg acagtctctt catgttcttg gcaaagaaca atttatgaat aattttatgt 

20941 tcttgacatg ctctgaagat gaatagcaaa cacttctacc agaagtgaag aaatggctac 

21001 taggaggctg agctttagga ataaatatat agtagattca attgttatct tttttttccc 

21061 tcactctttt actggcatcc agcatttatt ttgaacttca gtcagaagtt atttcactaa 

21121 gaaccttgtc atagcttcaa catctccctg aggctactga tatggtaact cggaaggggg 

21181 aataatgttt gtagggccag actcatgtct gggtccaggg tctttcagaa tgttcctctt 

21241 tcagttctca gaacaccctt gtgaggtcgg gagtatcatc gttactgaga agggcactta 

21301 tcacaagtta ggcagtcaga gtatatttgt aagccagtag taagacagag ccgaaattgg 

21361 agcccaaaca gttgttgaag aagtgcagga tcctcctcta ttttagtgtc actgaatatt 

21421 tagagctaga ggaagagacc agagaaggag attttgcaga tgagaatcaa gcctatggca 

21481 tttctacaga tggtcctcgg ggtgacatag agattttaga gttcatcccc aagtaggacc 

21541 cggattcctg actaatgtga tttcccccac tgatctgaaa gtgagctcag agaccccaga 

21601 tatatgcatg gggagttcca gccctaatcc tcacttgttt gtgatggcca gctatcagct 

21661 ccatttctaa gaattgtgtt ttaaagcaag tcacaattac tRgttttaaa aattcttcaa 

21721 ggtatctgtt attaaaaaat atagacacac atttgggttt tcagctttgg tataaacttt 

21781 tcttgttaga ctttgtaaca catgacctaa ttgaacagtt tactgatatt ttctataaac 

21841 agtttactga tatttacttt ccctccagtg agaaatgaac gaatttcatc gagtgcaaga 

21901 ccaagtttac tttctagaac attacctata gaatacaaat gtttacattc caaagactgt 

21961 tttattatgt aataagaatc atcaatttga catataaaaa tatgttttat tttgtaatag 

22021 agatctttct atacattaat tacctctaca attttataga tgaaataggg cagcacaaga 

22081 ttatggctaa aaaacagatt ttttttgagt gcatgcaaat caaatccaac ttaaaggaat 

22141 ctgtgcaatt gacggcaaat agatctctct ttgtgcagtt cccttggagg tgaaaatgat 

22201 aatctctgca tgcaatttaa ggcaattttg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg 

22261 tgttttcctg gtgttttaga aagcctggat aaatccaagt taattttagt tgtagcagtc 

22321 attccatcgg ttataggctt gtaatcaact cttagctgtg gtttgttcag atatttacaa 

22381 tgaatgcttt cttgtatcta tacagtactg taaaaggtac tgtgctataa Stttataagt 

22441 acaaatgtgc ctaaatctga tatattttat aaaattcagt tggcagctgt gaaggcttgc 

22501 ttttgtctct tcctcccctg cacaggctaa gtccttggaa tcagtatgtt aatgaagact 

22561 tggtatctca atatgaaatt tgcaccatgg gagttctgag ataatgcgtc ctgatagagc 

22621 ccctaatgcc tgcctcttag gagggcgagt gacaaagaca gtataactca gctaagtcct 

22681 ttcataatca ggcagccttg gtctttctca agtggttcca tagtttctgt gtgtgtgctt 

22741 tcttgtgttt ctgtgtctca gacctcttct ctccttgggc agtctgtctg taatctttta 

22801 atactgcatg gtagattgga ttggatgagc cccatcacct ttggataaga aaaggctttt 

22861 ggacacaagt ccatgcaaca Ytcttttctt tagagaacac aatagatcac tgggaacatc 

22921 ataagaggtg ctctcagagg gtgcttataa gtggcaccat cctaaatcca ggtttaccgg 

22981 ggacagtctg ggtttataac tcatcctggc ctaatgaata atagcaccct cttttagcac 

23041 agcatgattt agataagtta tatggtcacc ccatgcacat gggatgtggt agataacacc 

23101 ttttaagaat aaagtgaaat ggattcaagt gaagtgactt acctacccat gtctagttgt 

23161 gtcagattga atccagaatt ggtccaggtt atcctggtca tttcaccagc tacccatatt 

23221 gctgtccttt gaaggaatta caatgcttgg ctggttctct tttttaactt tcatgccaag 

23281 aaaccctctt agtaaagtct tagacatttg ttgtttattt tagttggtaa aatacagcat 

23341 taatttaaaa attacaaaat attgcataca taggaaggag tccgtaccct atgtagacat 

23401 gatttaagta atgatgataa aacagacata gtgacagaag catatagaaa ggtgatcacc 

23461 tgggaagggg tgcagagagg atggaaacat tctatgtctt agaatggtga gtacataggt 
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23521 aggtgtatgt atttgtcaga actcagtgaa ctctgcactt aaaatgggtg cattttactg 

23581 tatgtaaatc ataccttgat acagtagact aaaaatttac ctatttacca ctatccaact 

23641 taaaaaggag aacatttcca ataccttttt agctttctgc ctggtgccga ttcctgactg 

23701 acaagctgtt ttaaaacaca gaataactga ggctttattg ctaaatcaat gcctcctatt 

23761 agaagactga tatttctgtt ttagtggacg ttctttaatc tttattttcc ttatttcgaa 

23821 acgtcttgtt aaccctcttt acgtagtctt actcataaaa tattctaatt ctgtatttaa 

23881 caagcggcat tttattttag tgatagaaaa aactgtggct gaaccaagtt ttctgggatc 

23941 actggaatga tgttagctaa tggagtccaa tgttgaaagc atttgttgcg cacctactgg 

24001 atgtcagaca ctttgcagac acgatcttat ttctcctctg agcctaataa taggtatcat 

24061 ttcccttaaa tttcctcatg tgtagaattg aggcctatag atgttaagaa cttgcccaag 

24121 atccccatgc aaaagtaatt aggagaatta ggagtcaggg tttgagcccc agggatctaa 

24181 gtacattgtt aatgtagaga actgcccttt gctgggagaa gttaagcact tggctggctc 

24241 ccctggttgt atcgtgtagc accaggacca tgtggcgtat cacttccaaa tttcctaaat 

24301 cactgtgatc ctgcatactc cttgcccaga cccctcctgt ggctctctcc tgcttgccca 

24361 ccgaggctca gctgcctagc agcagatcca cagggcctct ggtatctggc gccaacccgc 

24421 ttatcctgct gggttttcct ctagctcccc tcaagggcct gtgtggctcc agctgccttt 

24481 tccattagtg ttcccatatg ttccaggtta aggcctccgc tttgccttcg attgtacctt 

24541 ttctgtcttc attgcacttg ccaaataaaa taaacaaaac aaaacaatga cagaccccag 

24601 ctctccagtc cctgtcaagc agtcatctca aaattttaat tcctgcctct tctaggatgc 

24661 cccaaatgcc agcctccctc tctgcaactc tactcatctt ttcatctaaa acagagttta 

24721 taatgattga gttggtcctt aattgttctt taattccatg tcttttttct ctaataagaa 

24781 taagaatgat aatagcttca tttattgagt cagatattgt gttctccact ttatgtggct 

24841 tccctcagct aactcccaga gcaatcttgg gaggtaagta cacattttct tcccatctga 

24901 cagactggaa aattgaggct cagagagaaa tgcctgagat cacatggcta ctaagtggca 

24961 agactgggat tcggtttgac ttgagtttga ctactctttg ctgacttctt gcaaatgcct 

25021 tgaggagagt agaaaagaaa ggaaactgac attgactgtg cctttgctat ggcttggcat 

25081 cgagatggtt tagcttatta tctcacaggg tccttatcct gtgctaagaa tcagatctcc 

25141 aaaaaaattg agaattagat ctctaaaaaa cggaggctct gtttacataa ttcattcaag 

25201 gtcacagcgc tgattagtgg agagataaaa attaaaccag gtctttctgc ccctagagat 

25261 gctccactgc gtgcatgagg acactttatt tctttttctt actctctgca tggcctagca 

25321 tagagctgga tattagggaa acttttcaat tagttatagg ttgcttaaaa aacaacaaaa 

25381 acaaaagttc cttttattct gttagaagca tgagcatatc acacatattc cggtagggaa 

25441 gagtagtacc ataaagaatg ttggattttt agaagagaca acgtaaaagc aaaggtgagg 

25501 tgagatagac acttgaacag aatggcagtc acagaatttc ttatcttttg agagatttac 

25561 ttgagggatt ctacctttac tatagtagag gagtgtctaa aattaaaaga caggagaaga 

25621 tgacttagta ttcccattca gatttttaaa ttgctgtgtg gttagctgtc tccagtcagg 

25681 cagcttcact gctcctttct ttaactgcct ttctcaggca tagaagtaaa aagaagtaaa 

2 5741 aggtaacggc cagttgccct cggcctctcc attggtggat ttgcagagtg ctgaaattca 

25801 ccctagagtt tacccttcaa agtaactagg cattaaaaaa ctcataccta caaaaaagga 

25861 actcctttgt gttccaacaa ttgttacaat atgttagact tttggtgatc aaaacacacc 

25921 tccattaact gctccagcat cttgtgcatc tcagctgttt gtgcgagtca tccagcatga 

25981 gggaggatgg gtctgcatca ggtgcagttc ctgggccttg gaatatgtcc tccacttttg 

2 6041 tttgatcagc ggcgtgtttc cattgagtgt gctgtgtgcc aagagaacag ttgtcagtgg 

26101 tattcgcagt gatgcacatt agcatgagtg ccagagattc aagtggcgac gagtaccaac 

26161 tctgccttcc acccattgct ttttgtgctc tcagaaactg gagaaacgct agctttgcac 

2 6221 actgggtttt ctcgaagcgg gctaggcgta tcatttccaa gggctgtgga gcttcctgga 

26281 actgtgcatg ctgtgaacac tctcccagaa tgatgaaatg ccaaagtgct caagtcacca 

26341 tgcctttgtt cactggctcc catgcctttc ttgctgtgac catccatgag gcttagaacc 

2 6401 atcacagctg tttaggaaat ggtcttggag tccaacagac caaggtcaaa ttctggctct 

26461 gtcacttagc agttgggtga ctgtaggcaa ttcatgtacc tctctgaact tctcatttct 

26521 catctgtaaa gtaagtgaga gtaataacac ttatggggtt gctgtgatga ctaaaataac 

26581 aaattctaag tatgtggcac ttggggtatg tgtgtgtgag agagagagag aaagagcggg 

2 6641 agggagggag ggagagagag agagagacag gcaggcagtg gctgtttttt tgaacggcga 

2 6701 tgcccacaga gggggagtta tttttggaga ggagggggtc ttctcactgc tcccaattct 

2 67 61 catccatgct ggggacgtgc atctgtctct gccactactc tgggaacata tctcttgtcg 

26821 gatacagctg tctttctgtt gttcccttgg tctgccagca cagggctgag gccattccct 

2 6881 gcactcctct tcctacttgt acccccactg caggccatgc ctggggactc cctgaccctg 

2 6941 ctgtccttct cagcttggag cttgctgctc tttatgggga ggactttggt tctctttgcc 

27001 aggctgcctg agaaacaggc ccaggtggtg tgaagtaata atccccagac caaatatgtt 

27061 tatcgtctta tttgagccat cctgagactt actgcaacac tactttaagc ttagtgagtt 

27121 taatttggat cacagcctat tatacatatg gcatggtgtc agcttttagt aaaagataaa 

27181 ataggtgcgt gaaatgaaag agacatagat gccagttgga gaacagagtg atctaaaagc 

27241 aaattctgca tagcaatatg aatacttcag agtggacagg ggtatagttt gaaagaagct 

27301 acctataacg actgttttca ttatattgac taaagaaaaa aaaagctcga caaaatctcc 
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27361 ctaactcgtg gagtcatatg gaagatagaa taagaagaca gtgcacaatt gtgggcaaaa 

27421 gaaggtaatg aaacattgca aagactttaa aaagcttttt ttaagttcta tattcacagc 

27481 cacatttcgt aaacttaaga ggtgctgcat tttttttcat tcattcatcc atccagcaaa 

27541 tatttattat ctagtgtgag taaacttgcc catgagccca tccctttcca ctgcctgtta 

27 601 ctgtatgtga caaggatagt agcgtgctca gtagcttgca accacgttgc ctgggtgagc 

27661 gactcctcca tctactggcc cagtgacctt gggcaagttc cataacctct ctatgctcta 

27721 gtttcctcat ctgtaaaaga gaaWtgataa tagcacctgt cacagagggc tttggggaat 

27781 tgaaaatgca aggagtgcag agcaatgcct ggcatctgag agacattcaa tacatgttaa 

27841 ctcttatcat ttgttttctg acttattcat ttgcttcttt attgtcactc tctctttcca 

27 901 tcattaaacg ctgagagaaa ccttgttttc tcttttctca tgtaactact atccttggtt 

27961 cctaggacag tgtctgccca tggtagcagc ttagtaaata tttgagccga gtaagtgaat 

28021 aaatggctgg atgaatgaat ggaatgcatg aaggctggac tctgtgctgg gtgcttccta 

28081 gacaggagtg aacaaaactt ggtgcctgct gggtatatgc ccaaaggaaa gaaaatcagt 

28141 acagcaaaga gatatctgca cccccatgtt tgttgcagca ctgttgacaa tagccaagat 

28201 ttgaaagcat cctaagtgtc catcaacaga tgaatggata aagaaaatgt ggtacatatg 

28261 cacaatggag tactattcag ccataaaaag aatgagatcc tgtcatttgc aaccacatgg 

28321 atagaactgg acatcattat attaagtgaa ataagccggg cacagaaaga caaacttcgc 

28381 atgttctcac ttatttgtag gagctaaaaa ttaaaacagt tgaactcatg gacatagaca 

28441 tagagagtag aatgatggtt accagaggat gggaagtgga gggatgggaa ggaggtgggg 

28501 atggttaata ggtatgaaaa atagaatgaa tgaaggaaca agatctagta tatgatagca 

28561 caacagggtg actatagtca aatatatata tatatatata tatttttttt ttttttgaga 

28621 tggagtctta ctctattccc cagactggag cgcagtggca tgatctcagc tcactgcaac 

28681 ctccacctgc ctcgttcaag cctcccaagt agctgggatt acaggcacac accaccatgc 

28741 ccggctaatt tttgtatttt tagtagagac ggggtttcgc catgttggcc aggctggtct 

28801 cgaactcctg acctcacctc aggtgatccg cctgcctcgg cctcccaaag tgctggaatt 

28861 acaggccact gtgcctgacc atatagtcaa taataattta attgtaattt aaatataatt 

28921 ttacagtaac ctaaagagta taattatcct ttgtaacaca aaggataaat gcttgaaatg 

28981 atggataccc atttatcctg atgtgattat tgcatgcctg tatgaaagta tctcatgtac 

29041 cccataaata tatataccta ctatgtaccc ccagaaatta aaaattaaaa aaaattaaca 

29101 ttaaaaaata caaaacaagt ggtgttgcct gcattttaga gcttccagct agtcaagtga 

29161 aggagaagtc tagagatcta taataaccaa aggtgctaag tgtcaatgca agggaagaac 

29221 qqqgacacca tgctgtatgg catcaaggaa gaagtggcat tttccgacta tgttgagtgg 

29281 tggactgata aacaagagca ctttttttct ttatccatta gaagacttaa aataattatt 

29341 tgtagatccc acttccccac tttttaaagc atttccttct caattattcc ctctcttcaa 

29401 acatacaaac ttatacgcat attctaaaac ttgtaaaagg aagttaaaca cattttaatc 

29461 atttaaaagg ttttttaaaa agtcaaccca tgtaacttaa aaatgattat gaagcacttg 

29521 tttactttcg gttgtctggc atttatttgt gtgtggggtt tttttttttt tttttttttt 

29581 ttggtgtatg ggctatttat tgctttttct taattttttc cttgtgtgtt tggaaagcaa 

29641 aatccaaaca aagtcctttt tggcagtggt aataataaat attctgttta gcctttggat 

29701 gaggtttaca tgatgtatct ataagcacat tctggaatat tggaataaca gtttccaatc 

29761 cttttgatca atggatagtg ggataaatga aaagttccaa ccaaaatata cctactggac 

29821 aagaaccatc atcttttaat aggcagaaat cctctttagg gtccagcctt cctttcattc 

29881 ctgttctaca ttgtcaatga ctctgaccac aggaacagag aactgctgga tagggtgcag 

29941 tgctgctggt cccctctatc atcaccagca caaataattg tattaatttt ttttttagga 

30001 gtggagttta agctcagctg tgcaccccag acattgatga agatagtcta tcgtatgact 

30061 tagagtcaca gaaccacaag atgcgtgtgt gattatctaa ataaagagct cttactgcaa 

30121 ccttgtgact ctggtgatag aactgggcgt gggagggctt tctcattcat gtgctgtgca 

30181 cacaaatgat tctctaagtt gaaggacaca ttgctctgca gcataactga aaaatctgtg 

30241 catgaatgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg ttgtgggggg gatgtggtct 

30301 cattcctact tgcaaaaatc ctccagaagg ggttttcaaa gcattgcaca gctgttgctt 

30361 gtcagcttct tgtgttgact agatgtcagc aacacttcct caactttgga caatgacttg 

30421 aaagcaagat cagtgtgtgc tcatgggatg gatttaaatg gaattgaatg atacagaaac 

30481 cattgcctaa tttatattta ggtaagtcat gttttctttc tctctctctt taaatttttc 

30541 ctgaaatgca tgtatgaagc tctttttggc taaaatgaga gaatgaagca ggctcctgag 

30601 tccttgagtg aagaggatcc attgagaagt ccagagggag agagaatgag cttgattaac 

30661 ggcagcacag gattggcagg agtgatataa aatagtacat catctcagag tgacctaata 

30721 ctcgactttg gaatgcattt ttatttttgg cagtggattt acctccctgt gtcctgcctg 

30781 gacaaacgtt ggaattagtc agcattcctt aagcacatgc tgattatggc caaagaaagt 

30841 ggcaggcttg aggcttgctg gcgaccagga gaaagccagc tggattagac atttgccgtg 

30901 agccttttgt tcatgtaact tatggggcag tctcctaagc cagagaaaca agaaactcaa 

30961 aggcagtttt agcaaggttc tcatgcatgg cagcatattt taatactaat gctcccttta 

31021 ggcaggccct aaataagaat gtcttgttct aatttttttc cgtagactta agattgttgg 

31081 gaaaaaactt gttagaattc ccaagtgaaa ccatttaaat gccttggctt ctgataactg 

31141 ggagtcctga gtgaagttgt cagttgctgc tgaactcagc agaaataatt gcgctcccta 
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31201 tgccaatgat ctgtgcatat tttgagacca ttctgagacc aaggaggcta tcacaacatg 

31261 ttcacattgc tatcgttgat tagagccctg agctttacat gttgtctcct gtggtgtcct 

31321 tcggaatatt ggagagccat gcctctttcc tttcaatgtt ctcaacctgg cccagccagg 

31381 attagaggct cccttctgag gttcagtaga attttatatt taccctttat cattattcac 

314 41 cttactgtat tgcacttctc tgtctacagt ctccagcatt ccattacagt gctaggacta 

31501 tgttggcatt tgtattcgtt tttcattact tcataagaaa ttgccccaaa tttggtggct 

31561 taaaatgatg cacatttatg atggcacagt ttctatggat caggagtcca gacacagttt 

31621 ggctggattc tctgccctag gtctcacaag gctgtactca aggaactggc tggggcttct 

31681 ctcatctgaa gcttggagtc cttttccaag ctttcgaggt tgttggcagg attcggtccc 

31741 ttgtgattgt ggaactcatg gtggcctttt ttctttcagg ccagcaggaa aaggtctctg 

31801 acttgagggt gggccccagt ccttctttta ttttattatt attttttgag acagagtttc 

31861 gttcttgttg cccaggctga gtgcaacctc ggtgccatct cggctcactg caacctccac 

31921 ctccagagtt caaatgatcc tcccacctca gcctcccaag tagctaggat tacaggcaca 

31981 cgccaccatg cctggctaat ttttgtatct tttagtagag atgggatttc accatgttgg 

32041 ccaggctggt ctcgaactcc tgacctcaag tgatccaccc acctcggcct cccaaagtgc 

32101 tgggattaca ggtatgagcc actgcacctg gcccccagtt cttcttttaa aggcatcacc 

32161 taattaggtc aggcccactc aacagttgat taaccttaat tatatttgca aaatcccttt 

32221 gccatttaag atgacacaat aattggaatg atatcccatc atattcacag gtcctaccta 

32281 tactcaaaag gtggtaggtt atacagggtg gatatagtgt tgaaagaagg aatgaatgaa 

32341 atgaatgaat tctgatagct ggaagcacag caactagctt tttttgtagc atgagtgaga 

32401 gaaggtattg agttccttat tgaagaatgc agttgcaaca gtttatcact agtgagctgc 

324 61 ttcttggtaa ctataagaca taagcagaca agaaacagca agaaaggaag gaaggagaaa 

32521 agagaacgag aaggaaatca ttattttgga ccgatttccg gtgRtggttt caagaagttc 

32581 tccccaccct ccagactgct gtcatttaca attgctagag aacaagagct gggagcctct 

32641 ccttggttct gctcctgtgg catagccatg ggggatgcag aagggcagtg gtgcccaMta 

32701 aagggaatcc ccgtcagctt tggttttagt ttccagagat tgagcctctg ccttaggctg 

32761 agagaaaaac ttagtgtgag tctttatttg ctcccttata atcacaggca catgtggcac 

32821 ctcctccccc atctctgccc cacctcttcc aacagtgaga tgcattaggt gtggaaagga 

32881 agccttcatt tcaaaatctc ctggctgcgg ttagctttaa aaatcaaatg gaactgtggt 

32941 cgtctgtgtt taatttggaa gagcagaaag gatgcgtgct ttgtggcgcc tgcatagtga 

33001 aagtctccat gacttaatac cactgcccag ccttagaagc ttcccatgca gacaatggcc 

33061 ctgcttcccc atcagttctt gggcttgggt gccaccttat ataMacattg ctgcttttag 

33121 tgcatcgcag aagcaaaagg atgacacatt aatagcagct ggaggatctc attaaaattc 

33181 Rgaggaaaat atgaaggctg gagcctggta atgtacagac agtaagtctt tgtctgtctg 

33241 attaacaatt gcatgYattc taagtgcctt gctggatggt aagaaccagc atttcagcca 

33301 gaagtaccat gttcttctcc ttaagaataa cctatttctg tttttcaatt cccttatccc 

33361 ctctaaaaat aatatgctgg tcttttggac tcccttgaga actgaaacag cttagcaact 

33421 ggggcagtcc ctgagagctg gtgcagaaac aactgttgct agcttggatt tgaattatgc 

33481 agtcagatgc ttacatcgtg ggagggggat gtacggggtg tgtggccttc agatgcttac 

33541 atYgtgggag ggggatgtac agggtgtgtg gccttcatgc cagaaaggac agggaagcac 

33601 tccctggttg gcgggaggag gggagatgct ggggttgagg cccttacctt tatttatgtt 

33661 gttcactggt acagccagac tgcaacctct caactcttca gacaagacct aaggccgaca 

33721 aaattcactc cctcatgaag aacattgggc tgtttctccc tggtgtgtag gtggtgggag 

33781 tgaagtagag gggctcgctg tgtgcaggga aagggctgca ttctcaagaa tttgtatgta 

33841 gcccttggac taaaaacaaa atattccaaa ggaatgcttc ttttggaacc ctggtccagt 

33901 ccctcgtcta agcaggagtg cccttatatg cttgactgtg gatggcatct gattacgggc 

33961 agaaggttgc actggattgc tgcctgatgc ggacccacag aacacagttt aactctggct 

34021 ctgtcagcag tcatcagtgt ttatttttcc aactcttcag agagatatta gtcgttatga 

34081 aaagtgtacc atctcagaac tccatggaat aagctgccaa tataagacag gttgaaatta 

34141 ttgtgtcttt ttgcctagag ttatttttag ttattttttc ttccttggaa tttcttttct 

34201 tttttttttt taaccatata tgctaacttt gtagctggtg gcaaaaaaag attattttcc 

34261 ctatattaat gacttcttca gctgaattga ggaatagggt taccagctct tttcttggag 

34321 tttaacactc tcaatctgat aatatgaata acacctcaat aaccacaaca agagcggatc 

34381 atcttcagta tctttcctga attacctttt ggcttttcac caccttgttt gtccgtttca 

34441 gagtttgtct ttctgtaaaa tcatgggaag gcatctgatt actttgttct tgatgtacct 

34501 gagtaatgcg aacatctgaa taattgaaca aaaatcttgt ttaaacaaaa cttttgtttc 

34561 agaagcatct gtcaaattgt tagtcatctg acaatttaaa atgtcaaagc aagagttggc 

34621 tgatcttaac cttagccttg tcacattatt aacttcatgg agcttcagag taacttgaga 

34681 agacttgcag tctgtacctg aaaaagccag gtttagaaag tggctcatga agaagtaaaa 

34741 cctctcagtc aagtggaatg tgcaagggaa cccctaggag cattttggca tgaacatttc 

34801 ttttgagttt ctagcccaag acctctttat aacatttgtt cctctcccag gtcctcctcc 

34861 tactgcctct aatttttaaa taagttctgt ttaactcatt tttggataat aagctagctc 

34921 tgagaaagct tggtagaact attttcactg aaggagttct aaaactgatc atatctcttg 

34 981 tcatcaaatt ataacctgac tgttgcctgt tatctgaaat tctaagatag tgaagagagt 
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35041 tattttactt tttcttgtag ggcccttctg tcattacttc cttgctagaa tcctatgtgt 

35101 gataatgagg gaatgaaatt atccaaaggg taatttgcag tatgcatcct acccagcagt 

35161 tgaaactact gcagcgaaac tgcaaggaaa cctaacccat gttttcttgt aatgctaccc 

35221 tttgggggtc atgcatgctg ctttctgctc caacagtctt aacgtgggaa ttggggacac 

35281 ttactttttg acctggaaca aatctctaag actcgtaagc caccaattga ttcacctttt 

35341 gtagcagggc ttgtatgaca aactcttttg tgagaacaaa aggaaagtcc cttccctttt 

35401 tgccattctt gttctgtaga ttaattagca aagacgtagt ggagaaatgt gtcacttgca 

35461 agacatttct ggcacggagt ggaccaagtg gaaatgctta aatgtgttgc agggatatca 

35521 aagggactga gcttttggaa caaagaggaa cagagcccaa aggacaRttg gtacaaatta 

35581 cacaattttc agttggttct aataaagttc taggaaaagg ggcttatgca caaaggctga 

35641 gctcttgcct ctgttgttct tcaaatccaa aatgtcatta aaatactagt agtggttgtc 

35701 atattttaat aacattttgg atttgaagtg tgtagctgaa acttactctt tgttaatgga 

35761 tgtatcactg ggtgagcttc tagactgcct caccaatgga tctagcatta ccttggttaa 

35821 tgaaataagg taaaacacag tccctgtgta ccagaaagca ggttcatgtg tcagctgcat 

35881 catgtcattt ggaactttgc aggtgtggtt ttcctttgct ccttcaatct cctcatttct 

35941 ttcattgcct ggatgaagcc cctgcaacag gagagctttt gcctcctctg cctcacctca 

36001 gccccctaga cctctgtgtg caactggagt atggtgttaa gtgagcctgg agcagtctta 

36061 ggaaaaggct tcaagtgagg ctaggtggga aggcctggga cagaaataaa ttcaaaacaa 

36121 caaccaaaga actgataggt ggaataaaag aagtgggaca gaaaagattc aaaggtggga 

36181 attgaaagag agaggtacag taagaatagc tggcgtccag tgaacatgta cagtggacca 

36241 tgcacagttc caagcacgtt cgatttatta tttcatctaa gactcacgag aactctataa 

36301 gacaaaggtc tgacattctc atccccgtgt tagatgtgag gaatctggag cacaaatgtg 

36361 taagtaattt ttctgttgag aaatggacct atcaggggtt gaacccaagc agtttgcctc 

36421 cagttaatgt ccttcgccac tttatattga actgagaaaa ggcgggacaa aagtcacaac 

36481 caaaggtggc tttggccttc agaatattcc acctactgcc atcaatattc caccttctca 

36541 agcccacatg ctcctgtttc tttttttgtc cccccattcc ataattaatc atccttgcaa 

36601 taaaagagac ttttggaaat tattttggga aaatgaagtc tttgggaggc caaggtgggt 

36661 ggatcacttg aggtcaggag ttttaggcca ccctggccaa tatggcgaaa ccccctttgt 

36721 actaaaaatt caaaaattag ccgggcgtgc tggtgtgtgc ctgtagtccc agctacttgg 

36781 gaggctgagg caggagaatc gcttgaacct gggaggcgga gtttgcagtg agccgaaatc 

36841 atgccacagc actccagcct gggcggcaga gagagactcc atctcaaaaa aaaaaaaaaa 

36901 aagaaagaaa gaaaatgaag ggtatagctg aaatctgctc atttcatgat cttgacaatt 

36961 taaaaaacct gattgctgtt gcagtaggac tgtttaaaaa aggaaagaga aaaatatatt 

37021 tttaaaaatc tgagcatgtt gaaagacaca aaggaccttt gttacaagtc actcctacca 

37 081 atggaagaac aaccgttgct ctcatctttt tttttttttt ttttttttgg tcatttcatc 

37141 tctccagtgg ctgagatcaa taagagatgc tctatggttg cttttgagtt cctttggatg 

37201 tcttaatgaa gggatgaaca tattgttttt tttttttgac acagggtctc attctgtcac 

37 2 61 tccctgctgg agtgcagtga cacaatctta gctcactgta gccttgacct cccgggctca 

37 321 ggtgatcctc ccaccttagc ctcctgagtg gctgggacca caggcacgca ccaccatgcc 

37381 cagctttttt tttttttttt ttttttttgt atttttagta gagacggggt ctcaacatgt 

37441 tgcccaggct ggtcttgaac ttctcagatc aagccatctg cttgcctcgg cctcccagag 

37501 tgctgggatt ataggtattc gccgctgtgc ccagccacat attgcttttg atgtagttaa 

37561 acagttaact gctaccaagc tcttcacaga ggttactggg ctcaggaata aaaaggcttc 

37 621 tactccaaat ctggtagcac cttgcaccac aaagacttgc tctctcagta agaaaacaaa 

37 681 tggcagttct atatatagtc aagagtggag gcaggaaaag actacatttc tgaaaaatgt 

37741 gttcccaaga aatccatgtt aatcaatgtc agagaccatc cagtttttat atattgcagt 

37801 ctcaagaaat attctgccac agattctcta aggactgatc ataaaacaca aaaccttttt 

37861 atcttttgta agtttgtcag ttctaacatt aaatgcttaa ttttgacagt taccaaaaca 

37 921 agtcgattct agctttcctt ttttaaagtt tcaggtgaaa tttgctataa tctcatgtcc 

37 981 atcctcattc aggcatgctg aaagcactaa tattctgttt gcttttgagt gctaaagaaa 

38041 tatagcccat gagattactg gccaaattga aaagaatgag gcttattgta ccaagatatc 

38101 tttcttgata aattaagtca atgatctatt tcctgcacta gaggtcttga ttgggtatgt 

38161 catagcgtaa ttttgtttgc cttctgaggc acacctattc tcttgctctt taacttggaa 

38221 agctcctaac tcagaagcca ctctgtgaac atatggctcc aaagattgct tcagattcct 

38281 ggctgcctgt gctaatgtca gtactcacct ttggacagga tatctaattt ttagattcta 

38341 gacagttgaa agtaactaac ctctctaggt ttttcttctg tttcctagag taaatgttat 

38401 caatgtgatt taatctgact agattactat acatttttga acgtaatcat ttatttgtgt 

38461 ttatattatt tgctcttgtt tcagctacgg aatttttcta tttttttttt ttttgagaca 

38521 gtgtgtcact gtgtccccca ggctggagtg cagtggcgca atctccgctc actgcaacct 

38581 ccgcctctca ggttcaagca attctcctgc ctcagcctcc caagtagctg ggattacagg 

38641 cgtgagccac tgcgcccagc caaggaattt tgctttcttt agtcagctag attttaaact 

38701 ccacgaggtc aggaaactta cttttcacac gtttttcctt cttatagaat aatattttct 

38761 cgatatgttt tagaggtggt attttatagg cagtggttaa agactgaaaa ctttaaccac 

38821 tttaaactct gaaattctag gcaatgggat ttgtactgaa agacatagga taattatgac 
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38881 acttcaatta tagtccgttc aattcaccac tggggtggtt ctaagtttag ttcatggtgg 

38941 ctttctccca ggaaatctaa agtactttac atatatagct ccattgtttc tcatagcatg 

39001 caaccttaac ccagtgtttt cacatgtgat ttgagagctc gatacccaaa ttatggtcca 

39061 tgcaccagca gcaccggcat cacctaggag ctggtgggaa atgcaggctc tcaggcctca 

39121 cccaaggcct catgaatcag agtctgtatt ttaacaagat ttcgggtgat ctgtgtgcac 

39181 attacagttt gaagagataa gacaccaaag ttaactattt ttcctgttat cacatggcaa 

39241 gtagatctcc agtcctgagt ctgggttgca tagcttgtga tgttgatgtt gcaaagatgc 

39301 aggtgccttc ttttcccatg atgatgcaga gccccctcag ctcctgttgg agcctgtgat 

39361 ggtctctaca ctccgtgttt ttcttcttag gctctttatc tgtacctgcg ttgcttttgt 

39421 cccttaaggt tttgtccctt tgatttcctc ttaagtgctc tgaacttaat cattttttgt 

39481 ttcttaattt taacatggcc atttcagtct gatgacagtt ttataatcaa aacccacatg 

39541 caatcgatta ataatgtttt ataaagcaac agttgctacg aaaaaaagtg gatcttcctg 

39601 agatgaggct atatgtaaaa tatggatgga atttgtatgc caattgggaa ctgatggaga 

39661 aatgtatttt ggtctgacat aaaagttcct ttctcaaact actttcatac aactaagcaa 

39721 tgaaagagca gagaattctc aacttgcttc actactggaa agtaaaggga aatttatggg 

39781 ggtaatagtg acattgagaa gtagttcttt aaagaactgc atttttcata ggtaactgga 

39841 aacatttcca gctttgaaaa tgaacagata tctttttctc tcatgtttag taaggcttta 

39901 gaaatgagta ggaacaattt ggtagaattt cttacctgta tttcaaaaca tgctgagtct 

39961 acttgttaag taagatgttt taaaagcatt tggtccttgg aattctgcta ctcataagaa 

40021 tgcatttcta gagccgggcg tggtggctca cgcctgtaat cccagcactt tgggaggccg 

40081 aggcRggcgg atcacgaggt caggagatca agaccattct cgctaacacg gtgaaaccac 

40141 gtctctacta aaaatacaaa aaattagcca ggcatagtgg tgggcgcctg tagtcccagc 

40201 tactcgtgaa tctgggaggc ggagctttca gtgagcagag atcatgccac tgcactccag 

40261 cctgggcgac agagcgagac tgcatctcaa aaaaaaaaaa aaaaaaacaa aaaaacgcat 

40321 ttctataacc tgcaccgatt ttgtagaaat tagtgtgatc tgaagctagt tgtcctgaat 

40381 atgctttatg taaatatgtg tggctcactc aagtaatttt cattctttag attacatctc 

40441 tcgatctttt tggatgtaac tggatttgct ggagttaaga aWgaacgttg tcattccccc 

40501 tgcccacccc caccttcctc ggctattgat gtaataaatg taattacggt actaagaaat 

40561 cctttccagc tgaaggaaca aaccacaaag tctgtgcttt aaatctttgt aattttttct 

40621 ttccgtttaa aatctaagcc atatattcag atttgggcgt gctttttgca ggtgtctatt 

40681 tttagagctt tccctgtttt gtaatttccc agttctcagt catgagtatt ggggaaaaat 

40741 gcacattaaa attggttttt acttttgttt tattttaaaa tttgttttga gttttataac 

40801 tttaaaagcc tcgtgcattc ttttcacagc ttccagcccc aggaacatgg caatggaaca 

40861 tgtttattgc cagtctagca tctagaaaaa gctaactcct ggatgacaca aagtaaatga 

40921 ttgcaaaaag aaagagagag aggaagaaag gatgacaggc aggggaaaga tcagggggtc 

40981 tgaccacaga ggtggcgcat tttactgaat taagcatgat agttgttgcc caggaaggtc 

41041 tcacagaggg ccacatgtgg ggtcagaggt tgcctgtcac atgtggttgg cagccagaaa 

41101 cagccatagc aaggctgagg ggagcatcat tcattttttc attcagttag gtttttgttt 

41161 ttgttttttg agacagggtc tccctctgtc actcaggctg gagtgcagag gtgcaatctt 

41221 agctcactgc agcttccact tccccaggct caagggatcc tcccacctca gcctcccgaa 

41281 tagctgggac tataggcaca cgccaccaac acccaactga ttgttgtatt tttagtagag 

41341 atggggtttc gccatgttgc ccaggctggt ctcaaactcc tgagctcaaa gcaatccgct 

41401 ggcctcagcc tctcaaagtg ctgggattac aggcgtgagc caccatgccc agcttcagtt 

41461 aatgtttatt gagttccctc cctgggttaa ggaatgatac ttcagcttta acagtgaggg 

41521 gtaggcgcca ggatcaggtc ctgggatgtg tggctgtatt tttctggcat gagcagcttt 

41581 ctcaggcatt ggttctttcc aagcaggaaa gtttattaga attttgcaca gatgcggttc 

41641 atttcaccca ggattttgtc ttagttttag cagcagagac agtgaatctc actgggtctt 

41701 aatcaggaaa aggagttggc cgttaaatga cctcaggagt ggcttctggc ttagaccctg 

41761 gagatgccgt gtggtttgct aaggatcaac tagctcatga caaatactag aagagactta 

41821 ataaatctgg acctttcttt accaatccaa gttgcttgag ttgtaaatga tgtaaaaata 

41881 gcaccctaca gacactagtt gttcagggtg aaatctctta tttccattag ccctgattct 

41941 agagaagagt ggagagttgg ccttaggtgc cctctttgtc tttgactgta tatgtggcac 

42001 actttcttca gccaatggaa tggcatacgt tgtatgccat ctgcaagagc ctgataagtt 

42061 gcattacaaa tagagtgtgc aaacaatagt actgattaag tgacaaatgt tgaggcctgg 

42121 gaactgattt ttggcactga caaattaagg cagattagac cgcctttcag gacacttttg 

42181 ataatgctac gtgtatgtga aagaagaagg tatcagaaaa aacttaatag agtttcttag 

42241 caagagtact ggaacataat tgtggatgcc taagaggaag tgtttgagtc aggaaaattt 

42301 cagttgtcct gaattgatga acgagtctct ttttagtaaa tgcttctttt aacggtcatg 

42361 gttagttagg acttagagac aaattacaga gtagcttatt gttatttctg ttgttgtata 

42421 ataaactttt ggagatgaac ttcattaaat gcccctggtc tggttttgtt ggtgaaaagc 

42481 tgaaataaat ctattgttgt taacatctgt ggtgatatag acttgaaatt ataatcttag 

42541 catgggttag aggagctgtg agaaacagtg aaaattcata tagcacttat tatacacagt 

42601 gtatgaaatt aataatctct gcatcctcct cccaattgct gaaattattc ttacggtttc 

42661 catttcattg tggtaccata gagtatgtaa gggatgctgg ggctaagact attagcatta 
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42721 tgggaatatt taaatataat ttaatatgag aaaaatcaga ggaatgtgca aactctaggc 

42781 tgtatttctc cctaatgggt aaatccagtc atatatctct gacagaattg taaaaacact 

42841 gagttatttg atctttcctg actgacttat cttggagtca tttatattta taacatgctg 

42901 tgcaccaaag catgaaaagc agaagcatac agtttcgctc tcatctgaag taataaaata 

42961 ttttttattt acatagtcaa tctgggatag attatagtag aaaaccccta caaatctgag 

43021 atactgaaag tggtagccgt tttcagagat aaataagtaa cctattagcc tgattagatg 

43081 tctttgctac tcacatggta aataaggcgt aactttgctt gtttactcct ttaaatttcc 

43141 ttatagatgt acttactttc tttttatatg tttaaggtct tagtagaatt tcatcccttc 

43201 ctccagcttt ttattgtata aaatgttaaa tgtggaaaaa ttgaagaatt tatgtagcat 

43261 agcacatctg tatatctact acctaaatta attaattaac atggtgctgt atttcgtttc 

43321 tctctctctg tctttttttt tttttttttt tttgagacag tcatctcgct ctgtcaccca 

43381 ggctggaatg cagtggcgcc atctccgttc actgcaaccc ccgcctctcg ggttcaagtg 

43441 attctcctgc cttagcctcc cgagtagctg ggattacagg cgcccgccac tacatccagc 

43501 taactttttt gtatatttag tagagacggg gttttgccat gttggccagg ctggtctcga 

43561 actcctgatc tcaggtgatc tgcctgcctt ggcctcccaa agtgttggga ttataggtgt 

43621 gagcaaccgt gcccagcctg catttgcttt ctctatttgt gtgtgtatat gtgtatgtga 

43681 aggtatgcgg atgtgtgtat atatatatat atgttttttg ctgaacagtt tgagattaat 

43741 cagggtaagg agatgtcatg atgcttaccc tctctctcta tatatttttt aaggcagggt 

43801 ctcactatgt cacccaggct ggagggcagt ggtgcaatca tagctcactg cagcctcgaa 

43861 cttctgggct caactaatcc tcccacctca gcctcccaag tagctggaac tataggcaca 

43921 tgccaccaca cctggctaat tttaatttta tttctataga gagaaatctt accatgttgc 

43981 ccaggctggt ctcagactca agtgatcctc ctgcctcagc ctcccaagta ttcagattat 

44041 aggcatgagc cactgtgcct ggcctaagtt cttgagcatc tgtttcctaa tagtaagggc 

4 4101 attctcgtac ataaagacac taacatttca aaatatgaaa aatgaacgat aattttctca 

44161 ttatatctgg tatccagtcc atatttaaat ttatctagtt gtctccaaat tgtcttttat 

44221 ggcgattttt tcttcctaaa ttcagataca cagtgctcat gcattgcaat tcatattgta 

44281 gtcctaaact cttttatgtt taatctagca tctagaacag ttctctcact tttttttttt 

44341 tttttttttt tttgagatgg cgttttgctc ttgtcaccca ggctggaggg caatggtgct 

44401 atattggctc actgtaacct ccgcctcctg ggttcaagtg attctcctgt ctcagccttc 

44461 cgagtaggtg ggattacttg tgcccaccaa cacgcccagc taatttttgt atttttagta 

44 521 gagacagggt tttaccacat tggttaggct ggtctcgaac tcctgacctc aggtgatcca 

44581 atggccttgg cttcacaaag tgctgggatt acaagtgtaa gccactgtgc ctgggatttt 

44 641 tttttttttt ttttgacatt gactttagaa agagaccagg ccacttgcct tggaggacat 

44701 cccacatgga ggatgtgtct gattgtttcc tgttgaggct gttcagtttc tctcattcat 

44761 gatttcctgt aaactggaag ttaggccaga agtttggagt ttattagttc agttaatctt 

44 821 ttttgggaag atgaggtaga ggagattaag aaaaagtcag tgtcatcttc gcagagtgat 

44 881 ttttcttaaa accctggcct ctgtgatatt gagaaagtca caatattgga aggtgacact 

44 941 tgtcaacttc tgattagagt ctcttcctta ttttaaaaga catgaactaa aagttgactg 

45001 ggtttcctgt cctcccccct tcaccctatc tctttctatc tcctcttctt tgctcttcca 

45061 tggccatgag tatgtgccac taataggaca ttttcttaca atgtggcaat ttccttgtat 

45121 acgtttatgt ctagcccacc attctatgag ctgctgacag cagggaggtg ttgctttttt 

45181 tgtgatttct tgaaacccaa tataattcag tgttatcaca ggccaattta ttaacattta 

45241 aaactctctt cttggccagg cgtggtggct cacacctgta atctcagccc tttaggaggc 

45301 taaggcagga ggatcacttg agcccaggag tttgaggctg cagtgagcca tgattgcagc 

45361 actacactgc agcctggata acagagcaag accctatctc aaaaataaaa taaaataata 

45421 aataaataaa agtctcatct taaatttgat ggggaaagga attttctgga tccacaactg 

45481 aattgtctat gcttggcagg cagatgtttt ctcaaatggt gaatggattc accttctttc 

45541 tcttccctag tttagctact ttcactccct tctagtcttc atatattccc aagtgaagtg 

45601 agaagatagg tctgtatatc agtgactgtc cttttctttt Wtcttttcga tttctctttt 

45661 aggagctgag atgtttcttc tctgaacttt atttttcatt gcttatggct tttttttttc 

45721 cccctgtaag ttctctgcga gctatacacc tgtggaaata tagggactct ccttcatttt 

45781 caaagagctc agaatatcgg tgagacaaag tcccttcctg cctggacatt cagattggcc 

45841 ctatggactt aaattgtcaa gcaagtatac ttttttgtgt gccccgcaac ttggcctaaa 

4 5 901 ctttgctata aatggcaaag ttactgaatt gccttggctc cataccaaat gttggaaata 

45961 attttaggat ataaaaacac atctttcata tgaaagtatt tcttttaggg tcccttgata 

46021 agcatatata tgtgttactc attttcctaa tgaattagYt cctttcttca attgtaagga 

46081 taaaccctca ttaatatcta atttgttgga atgaaaatga ttccagtaac atttttatga 

46141 cttcaatgtg actcccttat gaaaaggcat aatggaatgg ctactttaat caaatttgaa 

46201 gaaaaaagaa atattattct gaatacgcac ttcataaata ggaaaggcag ttttcagcat 

46261 ttactttagg tatcatttat tcagtgagag cttgagggta acccccaggt tcctgcaggc 

46321 aattaagacg aaagtcatga ttttggagag ctctgggttt agtaaaacca cattagatta 

46381 gagttcttag accaggaagg ggccctagaa aaccatcaga tcaagcctct tgcctttagg 

46441 caaataggat actaacccct gttgacagac aagcatattt gaaaccaata atggtgtttt 

46501 gaacaaaaga aaaatagggg cttggagcct gggttttcat cttttattgc actcttccta 
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46561 ctgtaattat agttttattt ttcaattcaa ttcattggat ccgagaggag ctcatttatc 

46621 gatttgcaac ctaagtatta gccatggagt cattcttgaa ctttgccgtg gacttgggtg 

46681 aaaaatgttt ctggttttca aggcttccct agtatcatat ggtattggaa tcaaaaatag 

4 6741 agtattgttt taaaactgtc agcataggca gaacttctca tccagcaacg cattccgtag 

4 6801 ctcaagtcct cattttgaac ataggattgc aatgctgagc ccctaccatc tttccactgg 

4 68 61 aaggaacagt tgggcagctt tttaaggtac ggcgcctcgt ggaggactcc attaatgagg 

4 6921 aatcatggtc tcctgacttt ggattgatca tttttgttca aaacaggagt attcctgagt 

4 6981 cagagtcagc cacagcttat aggcaacaat tttcaggaga caacatccca agttaactgc 

47041 ttatggcccc caaaactgta aggcttgtga tagtgcccca gtgaaatatg ccactgtggc 

47101 attcccatcc ttttagctcc tggatttctc cttttcagcc cataataggc aagggcaagt 

47161 aagggaacta acaattatat gagcacttcc tctgtcccac acatggtgct aagtccttta 

47221 aagaaaacaa ggttttcttc actggttctc tatggccatt attagagaat gctggagtca 

47281 ctgatctgga aaaacctgtt taattccttg gtctcaattt cttatctgta aaatgagatt 

47341 aaattggtaa aatatgtgaa gcacttaaaa agtatgtttt aaatggtact tagtgttacc 

47401 cacttcatat acatttctgc cagctctatc atacctaatc actcatttga tcatgtcatc 

47461 ctcttcccag tctctgttct ttgagctgaa gaatgtttgg tgattcctgg ctgcttctgg 

47521 gcaaaactcc ttagtttgtt gtgtgatctg gccccgtctt acctctttag actgatagat 

47581 cccttcattc tcgcattggt attaaaaatc aatatagacc atgattttgg gtttatggct 

47641 tctgtaagtt tgtgaggttt ctctacctaa aattccctta ccacctgtcc tcctatgtcc 

47701 cttccctcta cagtctccac ctgttcaaat ccccatgtcc cttcaggaca cgggtcccag 

47761 cctctgcccc aatgtagcct ttcctgcttg cctttaaact gggagtaata tcctcctttg 

47821 gattactagc cctgtattta tagctgtttt gaaaccctat ggaagtgtag gctttgaatc 

47881 atcaaaaagt attaagtgat gttcagtact tccatttact cagtactaaa gttcatcagt 

47 941 gtattttcaa acatttttca acaagacttg aaataaaaaa aaagtttttc cttaagaggt 

48001 tttttctttc tttctttcct tttttttttt tttttttgtc catttgacat ctgaatcctg 

48061 aattgactag acaaatttgg tttttctagt cagtggttaa ctgggacatg ccattcttca 

48121 aacatttcta ggaatcccaa tacctagtag ctgattcggt cgtgctggag aatacaaagg 

48181 cagtaatcaa agagcctaca cagagagaca ctgaatttta gaaccaggat aaatcaaagt 

48241 gactttagtg aaacgtcacc acgatctgac gtgatctgaa taaaccacaa tctcagagag 

48301 tgaggatatg ttttgagtaa atttgttctg tgtgttggcg agagtggctg cctagttaac 

48361 cgtgggcagg tctgtgagtt tgcggcatgc cccttcgttc cagctgcttg ctgataggct 

48421 ggcccaggcg ggatccatcc ttctgatcgc caggtcctga tgaggctggt gccaccacta 

48481 catccattcc agggagactc ccaRtctctg tcagtttctt ctgcttttct gaattctaaa 

48541 ctcactctga tctgatttca ttatttgtct gttgtggtag ctttggtgaa gttggaccac 

48601 aaataatgat atagaagaaa aaatgaactt tttttcttct ttcctgtgtc cttttatcag 

48661 gtatcatttc ctctataaaa ctaattttaa gttgatagag tcttaggtct atagccactg 

48721 ttgaatgcac ctaatcaggc catctccttg aactagagaa tgtttgcatc ataggataga 

48781 taccaggttc cctgagaggt gggtaccagg tgcctgggaa gtgaagtaac ttgccaagaa 

48841 cagagagcca gggagtgaca ttgcaggcat ttaagcctag gcagtctgag tcccgaaaac 

48901 aagtcaagac atttattgKc tttccatttc ttacccatgt tcttactgtt ttgcctggac 

48961 tcctctttta atctcagtga aaaaaaaaaa aaaagtacat atttaacaac tgaagaaaat 

4 9021 aaacagagac tgtagtaaat ttccaagcta atagcaagtt atgtaaaaaa tactacttgc 

49081 tgatgaggtt tgtaagaacc tcctagaacc ttatagaaca agtgtggaat acttgttttc 

49141 tgctaagggc tattgacccc aagaaaattc aatgaagcga tctttgcaaa atagtaaatg 

49201 atttttcttt gtctccctgc agaagcaaag aggcttagac ctctagccac tgttgaatgc 

49261 accaaatcag gccacctcct tgagctacac agaatgtttg cattatagga taggtgccaa 

49321 gtgccctgga aggttggtgg agagagagat gacttccagc tgagattgta cagtagaatt 

49381 taatatttaa agtttctcga tttgacgggg ctgaattagt tcatatgagt tcataagtag 

49441 gaactgcttc gcttaatttt ggtgaataaa atatcctggc tgcaaaatac aaaaaagaaa 

49501 gtcaccctca ttttcaaatg tatggttagt gctttccata gaaaacatat ggctaaatgt 

49561 gtgtgttttt ttaatttagc tctgaaatgt gaagactgta ataagatcta gtaacaagag 

49621 cgcagtttag aaaaacctga ttgactctag tttatgtaac tatacaggct atataaagaa 

49681 agtctcataa atRaacttca tctaaagagt gtggcagcta taaccttctc caactttcag 

49741 gctctgggtg ttctcctagc ttcttctgag tttatagtct ttatgtaatt attaatacca 

4 9801 tgatgatcat tctggggttt gctttttgcc ctggtcactt taggtttaat tttattccat 

49861 tcctggtcat ttaYagtttt gttctgtcat ctccccatac ccatcgagaa ccctattctc 

49921 ctgtggcaga ctttaatgaa tgtaaccgca gattatgtgt tttctttctt tggtgaactt 

49981 agaggatgca taggccaggt gccaagtgtt agaagaaacY atacctttcc tgatggtgct 

50041 gcatggctag cctaccccga ccatgcagta agtgacttcc tcaccgctga tgtggcagac 

50101 ctgctctgca tctccacagg gtttctgggt tcttttccag gctgtttcca tattgcacac 

50161 tcagactggg taggaaagca ggcagggcac agttttatat gtacgtcagc attttcactt 

50221 ggtgatggat ataagctaac ttttgtagct ctcctaggtt tattactgac atctgttccc 

50281 ccctgagcat atccagctgg ggctgaagcc acatctgcac tttaaacttc catctacctt 

50341 tatgttttca atgtaatttt agatttcatt tgtttttaat tgtgaactca gaagagatga 
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50401 gttccagcta tgactcagtg tgagttccta tgtcataatt ctacatttct ggagtgctta 

50461 cacctgacag gaataaactc aaggaaacaa aaagaacagg aaaggcctta ggcacatact 

50521 ggttgttcta gatgtggagt tctgctggca actgccacag agtgaaattg gccacatgga 

50581 atccaaagca cggcctcaca ttgttttttt tttttNgaat attttatgta agacagggat 

50641 gttaaaggta gaggaagtaa ttaagaatgg cagctatgtc tttaaaatta aacacatcgg 

50701 tcataacttt aatgaaggYg taaaagtgtt tttagtttta aacaggcaaa aaggctttta 

507 61 aatacaagta accagttttg agactttaaa aagcagaagt ttttcatgcc agtgcttcct 

50821 attttagttt caaaggaaag gaggaggagc tgaggtttgg atggttatca tagatgaggg 

50881 agttgacatg atcaaaaatg tttttttccc tgggaacaca tcttgagtgc ttatctcttc 

50941 taatagataa agggctggtg aattttgaat gtttcctgca gctctgaaga aacactgtga 

51001 tcctaatgaa caccgaggaa agcttgtatt gcagccctaa atattacctg cttcaaggag 

51061 gcagcatgtt ttggtacagt ctgatcatga ctataaatca aagcatcttt acttctccag 

51121 ggagataaaa aaaatcatgt gttactttat aaggatcttg tagttgcagt atgtctgtca 

51181 gatgtttcca tttttatgat ttaagacact tggtgctgct atgaatagca aattggaaaa 

51241 attgggcatt ttttaaattt tgaattttat cttagcatat atctggaaat gaaatagcga 

51301 tcttggaaca gagacatctt tgttagaata tgaaagaatg tcagtgaatt ctaacttgaa 

51361 gctacattga gatgacatgt taaaggcatg aatagacaaa gggaatgatt ttcaggaagt 

51421 gccttctgga gactgtggga aatcccgtca . tgggtagagg aacagcttgc gattggatca 

51481 aagtcacgca aagttacgca ggtggtagat gctggtagtc aacatggcta ggtccatggc 

51541 aatcgatcat ttgcccagca ttttgcctgc tgaatttggg ggagtgaaga ggacattttc 

51601 accaccccag gaagatagta actagtgctc tatgtatgta gctcagagcc cttgaatttt 

51661 caagtgacct tttaaaagtt tcccccagaa aggtttgaat agacttccta tctattactc 

51721 taagtttagg tctaagaaaa ttcccaaatg ggttaataca gtcgatcctc aatctttgag 

51781 aattctgtat ctgtgaaatt ggcctacttg ctaaaatgta tttgtaagcc ccaaatccat 

51841 ccgtgaggac tctttgtggt cattggtgga tatgtacaga gcagcaaaaa aatttgagta 

51901 gctcaaggta cacgtacccg gtggaggttg aacaaggtga cactctgcct tcttgcctca 

51961 gttctcatac tgtaaacgag tgtcctggat gcagacaatt cagtgccatg tcttctgcat 

52021 ttttgggctt tttcttggtg atttcactgt ttaaaatggc tccccaaata tagggctaaa 

52081 gaactgtcca gtaggacttt tttttttttt agatggagtc ttactcttgt cgcccaggct 

52141 ggagtgcagt ggcttgatct tggctcactg caacttccgc ctcccgggtt caagcaattc 

52201 tcctgcctca gcctctcaag tagctggggt tacaggcatc agccaccacg cccggctaat 

52261 ttttatattt ttagtagaga cagggtttca ccatgtcgaa caggctggac ttgaactcct 

52321 gacctcaggt gatccaccag ccttggcctc ccaaagtgct gggattacag gcgtgaacca 

52381 ccgcgcctgg ccctgtccag tgtttttaag tgcaagaagg ctgtgatgtg ccttataggg 

52441 aaaaatacat gtgttagcta agctttcttc aggcatgggc tttagtgctg ttggctgtga 

52501 gtttagtgtc aatgaaccaa caacttagat caaataagcc attttttttt taaaaaacag 

52561 aaacacacat aaaatgacgt tataagttgg ttgatgaaaa ttttgtgacc agaggcttgc 

52621 aggaacctaa ccctgtgttt cccccagaag ctaaggatta gtatttccta tggcggtgtt 

52681 caaggcaact tcatagacag aactaccatc atgaataaca aggatcactg tgtgggctga 

52741 gtgcggtggc tcatgcctgt aatcccagca ctttgggagg ccgaggcagg cagatcacct 

52801 gaggtaagga gttcaagact agcctggcca acatgatgaa accctgtctc tactaaaaat 

52861 acaaaaatta gccggtcaag gtggcgggca cccaataaaa tgacaataga gtttaaactc 

52921 catggtttct gaggcatttt ctgagtaaat tggcatacag cgtatgtact ctttcctcta 

52981 gaagttccag aaacaacact atttctttat gtgcaaaatg gcctcttttg agcagccctg 

53041 gggcagtttt gtctggccct cttgcagcca gggtgcccca gtttagtgta caattggtat 

53101 aaaaataggc aacacaggaa cttgcttgtc tcgggggaaa agacgcttgc agatttatag 

53161 aaattttaca tttgtatgca tgatattctg taggttcaag aaaaaacaat tcaatttcaa 

53221 gataacattc tacagggtaa ataaaattta atttcaataa atttaagggg aaaagttgtc 

53281 catctttcta tactttctgt aacttttgta gtttcaccac taacaacaac aaaaaaaaaa 

53341 gaacacaaac aaaatagcct tgctctgggt tttgaggaaa tggttttgca aggctatttg 

53401 gttagacaat gaattagagt cagaacttcc gggatgggct ttcggtaagg gaaatgctta 

534 61 ggctgctgca aagcctggat tcaacttaca caggatcctt gagaagttgt tcttcgcatc 

53521 cagaaccatg ggcaatgctc tatggtataa aaaccccgaa ggtaaacact gtctgatata 

53581 tattttttat aattgcaaaa tacacataaa cttaccatct tgactgtttt taagtgtgca 

53641 gttacatatt cacattgtcg cacagtctcc agagctcttt tcatcttgca aaactgaaac 

53701 tctgtaccat taaatgactc ttcatctccc tctctccagc tcagcccctg gaaaccacca 

53761 ttctgttttc catctctatg aatttgacta ccctaggtac cttatataaa tagaagcata 

53821 gagtatttgt ctttttagta ttggctgatt ttacttagca ttatataatg tccttaaggt 

53881 ccatccatac tgtagcaggt gtcagaattt ccttccttct taaggctgat taatatttaa 

53941 ttgcatgtat acaccacatt ttgtttatcc attcttctgt caatgaacat ttgggttgct 

54001 tccacaattt gactattgtg aataatgctg ccatgaacgt gggtttgcaa atatctcttt 

54061 gagaccctgt tttcaatttt tttttgttgt tgtatactca gaagtagaat tgctggatca 

54121 gacggtaact ttatttttaa tttttttgag gatctgccat aatgttttcc atggtggttc 

54181 caccatttta cattcccacc cacagtacac aggggttcca gtttctccac atccttgcca 
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54241 acatttgtta ctttcatttt ttttggtagc tgactgataa ttatgactaa ataatattgt 

54301 tgaagaacta ttacaatgtc aagaaatttt ggccatcagt gatagtctta tgattaaact 

54361 tagtagtatt ttattattaa acttagtagc atttattagt agtagtttta tttcagaaat 

54421 atttgcattt tccatgtttc tagcccctca attatgtagg tagaaacaaa taatatagaa 

54481 tcaatttacc ttatgttacc ttagaactgt ggccacagcc tagtaggtgc tcagttcatt 

54541 tttgttaaat gaatgaatca atgaccatga agacagttca ggttatttgt tatggagata 

54601 cgtaatggga ttggaaaaca tgtagggtaa atgtattagc cctctcctgt gttaactctt 

54 661 tagccctttc aaaactaaag gtatttggcc atgtgcggtg gctcacgcca ccgatcttaa 

54721 tttttttatt ttaaaaatag cttacttaaa aaatagcttt gttgtattaa aaaatagcac 

54781 ccagctaatt tttgtatttt cagtagagac agggtttcac catgttggcc aggctggtct 

54841 tgaactcctg gcctcaggtg atccacccgc ctcaggcctc ccatagtgct gggattacag 

54901 gcgtgagcca ccacacccgg ccaggctttt taacatagta acataaacat ttttattctc 

54961 acaatgtctt tatggtttgg gaatggtggc ttcctgtctt tcagagtctg gtgttatttg 

55021 tttttttggt gtgtgccttg cagacacctg cacttgaaat ctttcaggta ttttgcagtc 

55081 gttttctcag atggctggat tatttcaagc caagaataac agagttaggg tcaagactgt 

55141 gaaccgtatg gtcagtcctt ctaaggaagt atttattttt attcactttt gtttgccttt 

55201 ccttgggtca tggtcctaat tgctgttccc tttggctgca gttattcaaa actgaattta 

55261 ctgctgagcc taagacagtg tttttcaaac ttttaaaatt aagacaccca gtaagaaata 

55321 tgttttacag tttttttaca tcctgaccag atttacacac acacacacac acacacacac 

55381 acacacacac agaacctgaa gtatttgttg caagttgcag atgtttacag ttactgctaa 

55441 ggatatgtgt ttccatttaa tttacataaa atactgttca ttgaattgat tttatgactc 

55501 acattatttc Rtcatgcagt ttaacaaata ctgagcactg gtgtcacaga actttgtcgt 

55561 tcatgctctt tagcaacagt cagcttttct gctcatgact gatggcctat tgctatgatt 

55621 ctgttagcgc tttaaagcaa tttgattgtc aaagtcatgt tagctctggt tgtgtattca 

55681 gttactcagt ttcaccttta caggcagctg ctccttggga aatggggctt gctgagcagg 

55741 ttgaatgttc catagaatca gatctatact ttggggaact cagcagtatg ggaatcacaa 

55801 gccaacccac cctatccgtt aaagggctca ggcccattgg tctccattgt cacttagact 

55861 agcacaacat cacctacctc atggatgctt tgagggcagg gccggttttg tttttctttt 

55921 tattcctcag agcccgcagc atgattcctt gtgtggagaa tatgcatcag ccttggctct 

55981 ggtcctctct tcactgtccc tcttctcctc cctacagaaa acagcaaagc atgtatagaa 

56041 agagatgtgc agactgtgtg gaatgacctg aggggtaact tgcttagagg gtagagactg 

56101 ttgtgtgtaa aaacttatgt ttgaggcaaa ttgcaggaaa aagttcttta aatgaaggaa 

56161 agataagatt ataagttttt taaaaagttg tcttaatgaa tagaggagaa tgactagtca 

56221 ttatttcata gatcataggt acataggtga ttttaaaggt tgagtgactg gtcccttgag 

56281 gttagttcaa tgtcctcttg taatctgaat tttttcatca aattctttta aatccaggtg 

56341 gggctcagct ccctctgaag tatcacaaaa ccctaaatga attaggttaa taagactaat 

56401 tatatttcca tagcaatgtt taagtggcct gctgcccaat caccttccaa gccctgctct 

56461 gttgggtttc tttttaccca gtcatgatca cgtaggacca ggctgttttt cgtagggggt 

5 6521 tatggagtcg caatgcctgg ggtgttagga atataggctg ggcttgtatc atagtaccca 

56581 cgccgctgcc taactcttga tatccagtga ctgagttgga ctttgtagct gctttctctt 

56641 tattaggaga aaagagcttg tttaataaac ctaagaatta atagcctgtg ttcagtagtt 

56701 ggatttgtaa cctgaatgtt tttatgtcta ctgacttgca acgttgtcat ataaattaaa 

56761 gatcatagat ccagctatgg tttaaaggtg acttccagga catggatttg aaagatcaga 

56821 atttgaatct catctctttc cattctagct tggtaacctt gaggaaacca cttaacttct 

56881 ctaagcctca aaggtgtcaa gtagattaat gcaactatgt atagctgacc ctgaggaatt 

56941 tttggttccg ctagaataaa gcttgaattt ttggacatta gaaggatctt tgaagatgat 

57001 gtagcttcgt tgtgtaactg aagaccagag aaatttaagt gagtttcctg gaatagttta 

57061 tgcctctgct tcctgtgtca tgattaccac ccagaacacg tgcttaattg gggacaggta 

57121 atccattcag ggtgaagaac atggcttttg ggtcagtctg cagtggatct aaatcagccc 

57181 cctgctgttt actgtgtctg tagtctagaa aaagctattt aacttcttgc agatttagcc 

57241 taactatgaa attaaattgg ggatttaaaa aaactcatag ggacgtggtg aggcttgact 

57301 gagataaatt tatgcaaagc tcttagcaca atggctagta cccagaaagt gctcaacatt 

57361 attattattt cctaattgaa gggcattgat gatttaaaat aaaatggagg ctgggtacag 

57421 gctcatgtct gtaatcccta cactttggga ggccgagttg ggtggaacac ttgagcccag 

57481 gactttgaga ctagattggg caacatggta aaactccatc tctataaaaa atacacaaat 

57541 tagctgactg cggtggtgcc tgtagcccca gctactgaag aggctgaggt gggaggatgg 

57 601 attgagcccc ggaggtcagt gctgcagtga accatgattg tgccactgtg ctccagcctg 

57 661 ggcaacagag caagacccgg tcaacaaata catacatgca tacatacatg catgcataca 

57721 tacatacata catacataca tgtaaataat taaataactc atttcttgtc agataaatgg 

57781 ctgtatcttt ataataagat atctgtatcc tgtgacttca tcctgtaaat aattttgtgc 

57841 ctcttgtgct ttctatgatc tagggaggag aaagctaatt cttcttcatt ttatgcacgg 

57 901 agcagagaca cggagagcct ctaatttatt tcttcttggt gtggccctgt tttctgagca 

57961 tgggtgtgtc tgatccctgg ggagagcaga gccacactgt ggatctgagg tgctgggaag 

58021 ccatccagtt tcctcctcct gaccctgact caagtcttcc ctgaaatctc tgtcagcccc 
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58081 attctctttc tgtcagcccc attctctttc tgtcagccgc cctccttaca taacccaaat 

58141 gggttgtttg caaagctagt gtcgctgagg tgttctgtgt acaacagaaa agaatctagg 

58201 gaggattcct atgtgtcact aaagccagta attaagtgga caacaggggg agctaacact 

58261 gaatgcacca ataaatttca agactcctgc taccctaggt aaccgagtca atagtattag 

58321 aaaccatttt aagcattggt agttttaaac atgctgttta aaaacaattt taaatttacc 

58381 ttcacttttc taattggata ctttactatt cagagtacta tgagatgagg gtctcgcttt 

58441 ggagtgcagg ggactgatct tggcttactg cagcctcaat ctcctgggct caagcgatcc 

58501 tcccacctca gcctcctgag tagctgagac tataggctcg tgccaccata gctggctaat 

58561 ttttaatttt tttgtagaga tggggtctca ctatgttgcc caggctggtc ttgaactcct 

58621 ggcctcaagc aatcctcctg ccttggcctt ctaaaatgtt ggcattatag gcgtgagcca 

58681 ccatgcccgg ccaaacttct tttgaaatta gcttgttgat tcttcctcac ctcccagttg 

58741 tttttgtgcc agaattaatt tttctccttg tattatagaa tagtttggag gagtattgaa 

58801 agaattaggg ggtagagttg ccagatgtag caaataaaaa tacaagacac ccagttaaat 

58861 gtgaatttca tataaataac tacttttttt ttgctataca tatgtcccat gcaatatttg 

58921 agacatactt atactaaaag attgttaatt atctgaaatt cagatctaac tgggcatcct 

58981 gtattttatc tggcaaccct aactggggtg gggatggtga ggagggcctt ggatgtggcc 

59041 agaggagagg tgtcagagcc tcaggtgtct tcttgtgggt gaccgaggtg gctgcagcac 

59101 agtagccttg ctccctgggt ctggggcctg tgccttctcc ctgtagtcac ttagaatagg 

59161 atgatggggg tggctcaagg cagtggaata aattaattct gaaggaacac actgggccag 

59221 agtcctagaa cagtttactt aatgatagtg ttattttaat tttcaattgt ttgcctttct 

59281 tccctgtgat acggaataaa catgaaattg tatctggagc ggagcaggca gaacttacat 

59341 cttgttggtc ttgttctgcc ctcagcctct gatgttctac aaggtttatc cttttgtttc 

59401 tttttttatt atccaaggaa tgagatatgc cagaaaaatg tgacaggtat ttatgaaatg 

594 61 ctttgaacta ggtgagctta gagcataagt aattttaggt catttatctc atcacaacac 

59521 tatctacaga gttttaaccc ttatcataag gaacagacca tgatgacact gacattatca 

59581 acataacgac acacatgctt ttctattcct aatgctttgg tgagagaaac tggggcatcg 

59641 gagaatgttc tcagccatat ttttgatatg gcctaaggta taatgaacaa aagcttagat 

59701 gagaaaagtc catctgattg atgcctggct aattgacagc caattatgtc atggtgccag 

59761 ttcttaaaga aattgaccca ttaatccctt tgtgtggaga ggccagccag caggcatctg 

59821 ctttcttagc atgcagcagt actgaaaagt ttattgaaaL aaLcgtcacc tgtcctttcc 

59881 aaatcttaat tcttctgagt ttaaacatgt tttctctaag gaaagtcgga ttgacatgaa 

59941 atcacacatg tctggaatta tctctgagtc ctttataaac agaccaagac ttggaagggc 

60001 acaccttagg ttacagagtg ttttcctggg ctgggctcca aagcttcctg actattgaac 

60061 aataatgtgt tctttccatg ctactttata tatttatttt acagaagcct tgtgcctttt 

60121 atccgttcat atccactttg aaaacttact tgtggcctgc cagagcagag gtacaaaaaa 

60181 ttccggatta ttttctgttt agaggcccca cacaatacag tatatgtgct gaagatgagg 

60241 gcactcctcc tcctcctcct cctcctcctc cttcctatcc taccccttcc tccacctcat 

60301 ttttcttctt cttttttaaa aataatcata agcatgtgtt ttcttctcat gtgcttgtaa 

60361 atatttttgt tggggattct tgactgggat ttcagaatat cctgataggg agaagttggg 

60421 tatttcttca tgagataaat tccttaagga gagatttgaa tagttttgac ctttgggttt 

60481 tcttttctct ttcagttttc tttctaaaaa atgttacatt tcttggttat gagataaaaa 

60541 caaacctata atttgtgata atgggtgaaa atgtgattag aattcacatt ctaggtttaa 

60601 taatgacaga ctacttatga aagataagat gtcagagctg gaaggcttct tagatattgt 

60661 cgggttcaat attcttttct cattagagga aatggagact cagagacatg aagtgacttc 

60721 tccaaggtca cacagtaagt gagtgatgaa gctgggagta ggacctcttt tgcctgactc 

60781 caaacacagc tttccccaac tattgaggaa aaggactcag gacaatttaa catttcaagt 

60841 cattgaaata tcctttaaat gctcaaactc taattttaac cttatgtgtg tgtgtgtgtg 

60901 tgtgtgtgtg tgtgtataag tcgttctaaa gtacttaacc ttctgaaatc ttattttgac 

60961 catgtagaac acagttcgac cttttttcaa tctcatcatt atcaacactg ttttgtgaac 

61021 atgggtcatt gtggttttaa ttcatgggtg cccttgggct attctgagtc tataggactt 

61081 gcccttagtt acattaacac tcccacatga caaaactcat gagtgcatgg ggaacttttt 

61141 gatatccttc ctcattgtca gttgtcactt tttgctactt tcagaggtga tatttatagc 

61201 atttcccccc ttctgtgttg ttcctaccca caggatttta acttacagag atgactgaat 

61261 gatgacacag aggggacaag tccattgaaa taagtcttgt tttgttttgt tttgttttgt 

61321 tttttgggac agggtctcac tctgtcgccc atgctggagt acagtggcac aatctcagct 

61381 cactgtagcc tttgtctccc aggttcaagc tattctcccg cctcagcctc ctgagtagtt 

61441 ggacttacag gcgtgggcca ctacgctcgg ccaatttttg tatttttagt agagatgggg 

61501 tttcaccatg ttggccaggc tggtctcgaa ctcctgacct caagtgatcc tcctgcctcg 

61561 gctggtgtgc taggattaca ggtgtgagcc acagcacctg gcagaaagaa attttttttt 

61621 attactcaca tttcctaaga gaagagggca ttccatgcca cacagggcca ggaggagaag 

61681 cacctatttg ggtgaagagg aagagatggg agtcagggga aagccgaggc cagagccttt 

61741 actgggtttt tatggaaaag gcaaggcaga ctggaggaat cagcttgggg ttggctaggc 

61801 gctgggatgg tctttagttg tcagttcctg gccctgagag atttagggca ggggaaatgt 

61861 gggctgggta actgagagtt agataaggag gtggctcagc tagatcacag aggagatgga 
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61921 aactacttgg ctgttaactt gccctgtaat tgatggatac caaatagcca aatacagatt 

61981 cggagaaaat gcaggacaac ttcccaggct tactttgctg ccattcattt tgtggtgaac 

62041 caaaaaacca cttacacaaa cctgttggaa ggggtctctt ttgtgagtac tgtcaaagag 

62101 agctatctag aagagacctt tttttgtgag tggtgaggtt taggctgaat ttttattttg 

62161 ttttggtgct tttctgtaat ttgggattat taaaaacaaa gactagacct ttttatagat 

62221 agaaacaagg ctcttttatt tggaaatacc atgtgcataa atgatgaata aatagagtca 

62281 gtgaggacct tcttgccctc atgattcatt gttcttctcc ttctcctttt tctccttctc 

62341 cttctccttc tctttctccc cttcctcctc ttccttcttc ttcttttttt ttttttaata 

624 01 tgagacacag tctcgctctg ttaaacactt gtattagttt cattgaaagt gtaatataaa 

624 61 atctgaggat ctcagacatc ttaggaagat gactgtcatt tattatctat taaccaggtg 

62521 agcaacttcc tggcgagtga ggggtgcggc agggaagggt ggagacgcaa aggcacagtt 

62581 ccctttgtaa tggagagctt cagctcctgg gaaggatctg caatgcttag tggggctgtt 

62641 catctctaat atagttaata attactgatt tgtatgaagc agaactgagg gcagggaggg 

62701 atgtgggaag gccacaggag atatgagttt gcagccagag tttaccggtg atgaatcacc 

62761 ttctgatcaa agcaggaggc tgggacctgt gggggatgca cttcgaccct ggggtgtgac 

62821 ccaggaagtc agtgatttgt gaagggcatc ataggtcaca gatgcctttc caaagttacc 

62881 agaacttggg atccaattta tttctcttag tccaggctca ggtttgattc ccaatccctt 

62941 gcattcacac gagtcacagg gcagaaaagt gcagacgtgt tcttgtgcga cctccagatg 

63001 tggtgtggag atggaagatg gtactctcat cattagagct gatttgcttt ggaattaaag 

63061 agagacatac gcactgtgtt tctgtggccc atcatagttt ccactgatgg tgccatgtgt 

63121 catttggggt aacattgact tgtatttcca ctcagtgtga ggaaaggacc ataagacaga 

63181 attggagtaa tttctggaaa aaagaagtaa atgcttagta gagtgtcata cagtctttta 

63241 ataacaagta tttgataaac atgattttgt tatccatcct tctgcaggaa aagaagccaa 

63301 gttaattttc ctgagtttac agattggagg gtttttagta taacctgtgc ctttttcctt 

63361 caccctgttt cctcttttac tactacagta aagagggtga aatttagttg caaaaggata 

63421 ccattgaaat ttagttactt ttgctcgtct cttgctaaaa gagttaataa tgtgcagtct 

63481 ttaacttgga ctgattttgg tataatgtag tgggtttcta aaaatagatt tctttttcat 

63541 gtaattgaca attaactcca taagttactt tacagaaatt taagtttctc tagaaattac 

63601 tgcagtacac attgcatgca ttctccttaa agaaaattga cagaacaaaa tttcatcctc 

63661 tgttaggagc ttgcttttcc ctcacttgct catctcatga ggggaagcat gtattatatc 

63721 atgtaattga cctcccgagc tgtatggcac ccttgagtga accaggtaca agcagcctcc 

63781 acccaggcat tttcttaatg tccacaaagg ctcgctgact tcaagttagc tatttgtgtc 

63841 ctttaacttg ttgccacaat ttaaaaccag gtgagcattt tctgcacaga gtggtcataa 

63901 gcagtgtctg ttctgctgtg ctcggcctct ttgtcacctg ttccatattt gggcatgaag 

63961 cactaggccc atatgccttc accattttgc aatgttgttc tgggacagag ttatagggtt 

64021 tttgccttga acaaagcatc tacattcttc attcttaggg agtgacagtt ccattgccaa 

64081 tatgtggata tcagttttct tcaagcttgt gtaccactcg tatccactgc tgttcagttg 

64141 cataatctct aagattaaaa actacatttt ggtaatgctg gcaacgaggg cacaaggaaa 

64201 taaattgtct gtttttataa acatgtagct actgatattt ttttttgaag gtgaaagctt 

64261 tatttaaagg aaggtctcat agagagtttt aaaattttag aatgaaaaaa ggtcttaaat 

64 321 ttattaacca aatagtaaac taattcattc aaccaaaaga cttactgaac actccctgag 

64 381 tgaaggtgtg tgtgttagga aatgtgctta ctgggtgaac acaacagagg tgttccccac 

64441 ctcatgatac ccacagtcta gggagtgaca ggcaataatt aagtaatttt taaaaattta 

64501 tagctgtaaa gtaaagcaaa ctatggcacg tccatgaaag agaggttaga gcctatgagc 

64 561 ggggaggcgg ggaccagatg gagcctgggc tcaggggctg ccagacacac acagctgcct 

64 621 acgggcagga aggggcccca ttggaagcgt taaacaagtg tctgtataga aggaaggcat 

64 681 gtgaataaag aaaaaaagat tgacctttgt gagttgtgac atccaaggtg tccaggaact 

64741 agtttctcca ctttcttttt ctcctttttg tcatggccgg cagttgggcc cttgtactgt 

64801 tcatcagctt caccatcaaa atcaaatgaa aagaaagaga agggaatgga atgctcactt 

64861 tctagacctt actttaaatt ctgttgaaga tttattaata tttgggagag agtttgaaat 

64 921 gataatccaa aagatgtctc cctttgaaca tatgtagaag ttaatcattt agattaactt 

64 981 gcatttaaac acataattgt atgtgatcgt atacattttt tggctcactg ttttgtgttc 

65041 aaaggcagat ttcctagggt agtgctgatt tattctactt cttttaggct gtgtaattca 

65101 ccttttttct aatttgggga aaattgtggt cactatagtt atatattttt tatttaaaaa 

65161 cttgtaatat tttattaagt tatctaccca aatcttcctc ctgaaagata atatcttttt 

65221 ttcctttttt tttttttaat ttcagtaggt ttttggggaa taggtgtcct catagcttag 

65281 ctcccactta ggagtgagaa tataggatgt ttggtttttc attcctgagt tacttcactt 

65341 agaacaatgg tctccagttc catctaggtt gctgtgaatg ccattatttc attcctttat 

65401 atagctgagt agtattccat tatctatctc tgtatatgta tatatcacaa cttctttatc 

654 61 cacttgttga ctgataggca tttgtgctgg ttccatattt ttgcagttgt gaattgtgca 

65521 gctataaaca tgYgataaca cagtatcttt tcagatatgt gtccgtctga catgttctgc 

65581 accacccact acactgcagg gcatttaggt aagcagcatt tataagagtg actggaaatg 

65641 gcttaggaaa aatggtacag aatgtaattt atcagaataa tcactgatcc tcttagcatg 

65701 tgtattcaac aggtaactgt gactctaaaa tgttatttgg gagggaattt gcaaccaaag 
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65761 


ttagaaacag 


ctctagaggt 


gctgggatct 


aggggcctta 


gattttttca 


tctcagttaa 


65821 




ctttaggaag 


ctttaggata 


agattctggt 


gctagtgtcc 


ccggttggtt 


65881 


ttgcttctga 


gaaccaattt 


tgcagttgtc 


acatgatgag 


ttggatcctc 


atgcagatcc 


65941 


tccaagggtc 


ttactgaaca 


ccatcggttt 


atgctagtga 


gggtttgtgg 


ttctcagctt 


66001 


cccgcattgg 


aatgaacatt 


atttgggcct 


gagctgtgac 


agcttccttc 


t t cage t tec 


66061 


ttccttgggt 


tcttctggtt 


ggggtatgga 


ccacttcttc 


tggagaacag 




66121 


ttcaagcacc 


atcgtactct 


cctagtctag 


agctcagcca 


cccagtacta 


taaccaataa 


66181 


agatatttgg 


ctcaaaaaca 


caligagagat 


ggtagtgtga 


cagaat:33t:g 


qaattqt33a 


66241 


ttgtatttag 


ttttaatcaa 


aactttaaaa 


ttgatatcca 


atatatatga 


ttao3aca3c 


66301 


ttgaatatgt 


gagcttacct 


tttcatctct 


aaagt't'ta'tg 


ataa3ag3tt 


tccagta33a 


66361 


atttaatgtc 


caagttgaga 


tatgctgtga 


gtgt 33331:3 


catgctat3t 


ttc33agact 


66421 


tagtatttaa 


aaagaataat 


atacaataag 


taattttaaa 


atgttgatta 




66481 


tgataatatt 




taggttaaat 






tqttatcttc 


66541 


ttaaaattgt 


gatgactata 




ttgcataggt 


ggcttacata 


atgtttctgt 


66601 


tattgggcat 


tgccctagag 


ctaactgagg 


aaaagatcat 


agggcaccat 


ttgccattgt 


66661 


tgtcagtatg 


tggctttcag 










66721 


taaatgccac 




aaacaacatg 


gcaaatcttg 


cccttaaggg 


gacctcattt 


66781 


tttccccagc 


aaaacaaaac 










66841 




aaatgaactt 


tattggttaa 


taqqtqcaqc 


aaaccaccat 


qqcacacatt 


66901 


tacctatgtg 






catgtatccc 


aqaacttaaa 


attaaatata 


66961 


tatataaaat 


aaactttatt 


agttatgcaa 


tga33t333a 


cagagc33gc 


a3taccttta 


67021 


ggttcactga 


attttaaagt 




aatacaaatg 


tatatatttt 


gtatacaogg 


67081 


ttatgaaggt 


atgaacatat 






aat3cttgtt 


cctcttttgt 


67141 


agttttaaat 




aagaagtaac 


ttgacatgac 






67201 


ctattctgtc 


gtgggctcca 


aatgtaaatt 


C3l:c3g33g3 


gctcacagct 


ttgttgaatc 


67261 


tcctgcctct 


ttggggagat 










67321 


gaggtcagtt 


agaccactcc 


aaggaccact 


ctottttctt 


ctttgtattt 


tctggggasa 


67381 


acgtatagct 


agcatgctgg 


gactgtttgt 


tttgaatgag 




agttttcaaa 


67441 


acaaaattat 


gaaaaaaagg 






catgtcttgc 


tggctaagca 


67501 


gatgatctct 




tacaatatca 


ttttaaaaqt 


aacaaaqgta 




67561 


ttatctttaa 


qqctqcatat 


tttattqttt 


atacattqqc 


caattaqqaa 


ctqaaattta 


67621 


agaaagacag 




ctattgaagt 


cacagaaLga 


aat at aa 




67681 


tgttcccagt 


ttagcatata 


tgtatatata 


tgcattgatg 


aataatatga 




67741 




aactgcagaa 


acctgtaa t: t 






carcqtaaca 


67801 


aatattacca 


ggtttttaat 




aaaatgcatt 


tctagggctg 


qqcgtqqtqq 


67861 


ctcacgcctg 


taatcccagc 


actttgggag 


gccaaggcag 


gaggatca eg 


aqqtcaqqaq 


67921 


aticgagacca 










caaaaattaq 


67981 


ccgagcgtgg 


tggcacgtgc 


ctgtagtccc 


agctqctcgq 


qaqqctqaqg 


caggagaatt 


68041 


gcttigaacca 


gggag'tcaga 


ggttgcaatg 


agccaaqatt 


gcggctgcac 


tcctqcctqa 


68101 








aaaagaaaa3 


aagattttct 


aaasttatst 


68161 


ttatactctc 


tgctccttcc 


coat cage ca 


tcaacgcttc 




tctcccttcc 


68221 


ttaatgataa 


gccctcagct 


ttgctcagga 


ctcagcctcc 


catgtggttt 


tqqtaaqtqq 


68281 


tctaagacct 


gaggcccaaa 


gcgtiga'ttgg 


ctgatgctgt 


gatttotcag 




68341 


cattagaatc 


accctgggag 


ctttagaatc 


cagatgcctg 


ggctctactc 


acaqaqattc 


68401 


tgatttaatt 


ggtgtgttgc 


agaacctgac 


ttgagccatt 




tcattgatag 


68461 


cttgtagctg 


taacgattca 


aacatataca 


acataagcag 


ggtgaccaca 


gaagttattc 


68521 


tctaagtctg 


gtatgttcct 


aaatgtcctc 


taaattctta 


tcttccactc 


cagggctt tt 


68581 


gaagtggcct 


gatccaaaca 










68641 


ctcatcagca 




ttccttaggt 


gagccaggtt 


gcttttaatc 


tcttggagct 


68701 




ataggccttt 


tttttttctt 


ttttgtttta 






68761 


tacaacatga 


tatttgatat 


acatattcct 


agtgaagtaa 


ttactacaat 


taaattaaca 


68821 




ttcacatagt 


tacctttctt 


tttttgtggc 


gagagtacct 


aaaatctact 


68881 


atcatagcaa 


atttgtaata 


tataatgcaa 


tattattaac 


gacaatactt 


gtgtggtact 


68941 




tcatccaaga 


taactgcatc 


tttgtatgct 


ttgatctata 


tctccccatt 


69001 


ttccctgccc 


ctcccgatta 


tgccttttga 


ggoagtatag 


tgtgac3gtt 


agccatcact 


69061 


gaattctggg 


ggccttttgt 


ctattccagg 


g3g3tg333t 


tgtt3g3g33 


agattaggag 


69121 


aaagaatatg 


aaaaggactt 


agagaggata 


caaaatcatg 


aagtcacaaa 


tacctgccaa 


69181 


acaaatccac 


ataaggagaa 


aataaagaaa 


tgtcagattc 


ataaaagatc 


aaaaaaccca 


69241 


aactgttctt 


agagggaaaa 


gcatgcacag 


ttaaggaaac 




aagttttaat 


69301 


aaaatctgat 


ctctagtggt 


aaatactgtt 


tcccaattac 


acctagactt 


ttatcgtgaa 


69361 


tcaggttttc 


tagttgacat 


tgttgttttt 


catttgaaaa 


tgaatgctgt 


gtattctoct 


69421 


tgtttcaatt 


cccttatgta 


tttttgtgtt 


aactccctct 


gcoggggagc 


gooagactca 


69481 


gatgaaaggt 


attatgacat 


togagatgaa 


taatgacgta 


agaaggaott 


actctattgt 


69541 


atatcacggt 


gcagtatcaa 


aacattttgt 


ccoatgagga 


gaggcagtga 


gtcagaasat 
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69601 cctgttggaa ttgtattata atgtagaaaa ccatttcaga attactgtct gacatttggg 

69661 cagctgggac tttgagctca ttccacggcc accccacctt agacatttta ttaggaagat 

69721 gctattcttt tttagggcta tcactggatc tttgtcctca ctgatgaatt aattaggtag 

69781 gtaataatga atctgagtta ttactgctga caatttagtc ttattcttaa aaacgttcat 

69841 ttcatggtgt gttactctgt tctgtgttgc aataaaggaa tacctgagac tgggtaattt 

69 901 ataaagaaaa gaggttttat tggctcatgg ttctgcaggc tctgcaagca tggcaccagc 

69 961 atctgctccg ctacctgtga ggcctaggaa ccttgtaatc aaggcagaag gctaaggggg 

70021 agcaggcaag acacacggcg aaagagggag caagagagag caagggagga ggggccttgc 

70081 tcttttaacc aaccagctct tgtgtgaact cagaatagga actcacttgc tatggcaagg 

70141 acaggaccaa gccattcatg agtgatccgt ccctgtgacc caaacgcctc ccactaggcc 

70201 tcacctccaa cattggaggt cacatttcaa catgagattt ggagggggaa aaaacctcca 

70261 aacctcatca catggttaag tggcataaga tgcaactcta ggaatttttg cttttttaaa 

70321 acccagcttc gagctccctt ccattccttc atcaagatat tcatgatgga tctgcattgg 

70381 gacctgcttt tgtcttattg tttcccaaga ctaaaaaggg ttttcagcgc aagtgttctc 

70441 gtaatgtttt tgtccccaag attggatgtt tttcagttgt attatctccc catatttctg 

70501 acatgtacag acgtgcccag ctatgataKt ctgtgtggat ggtcttaaaa atgagttatc 

70561 attatgaata gtttgatgga aagatttccc aggaagcatt atgctttgag tgttggggaa 

70621 gagaagccac cagatgtcgt gtaggccctt gtccctttac tgcctttcct ctctgtttgt 

70681 acccctgtac aaggcagctc ttgtggttac gatttgttaa caatttgagt agctccctta 

70741 tattttgctt ctagagtaat taaacattgt ttcttagaaa tgtaaacatg ggacttagaa 

70801 ttacgatgca gtaatgcatt tggcagctga actgtgtact gggaaaagaa aaatatcctg 

70861 gagcgggttt aagtttcgca gataagaaaa atggctttat ttagtgaatt ggaacaatac 

70921 aagacctgtg ccctgaagtc agtactttct cttcccagat gagtttcccc aggcacaaga 

7 0981 ccaacttcta aaataaaact tgaaagtaaa tggcaaaaac cagaaacaga gccctgtaaa 

71041 ctagactttt atgctacggc gccataaaaa taacttattt gctatcaaaa taagcttaaa 

71101 cagaaccttt gtctgacaga tgacttttcc tctcccctta acctgacagc acccccaacc 

71161 cagtctttgg tcataagcct gacagcacgg tgagatacaa gttcctagtg gcagtggagg 

71221 ctgatgttta ttatttagag agtttctgaa aatgaaagtg taatgtcttg agtcacttac 

71281 ttccaaattc ttcccagaga actttaaata gtgttgtatt aaaaaacagg agtaaacaat 

71341 cattcgtaaa acacacccca gaataaatct cttatttctg catgaaggca actgatctga 

71401 aacatttttt ctatgtgcct tttagagacg gcaccaaatt tcatgcgcgt tcgtgtgaag 

71461 agaccaccaa acaggctttg tgtgagcaac atggctgttt atttcacctg ggtgctggtg 

71521 ggctgagtcc gaaaagagag tcagcaaagg gtggtggatt atcattagtt cttaLaggtt 

71581 ttgggatagg cggtgaagtt aagagcaatg ttttgcgggc aggagtggat ctcacaaagt 

71641 acattctcaa gggtgaggag aattacaaag aaccttctta agggtggggg agattacaaa 

71701 gtacattgat cagttagggt ggggcaggaa caaatcacaa tggtggaatg tcatcagtta 

71761 aggctatttt tacLtctttt gtggatcttc agttacttca ggccatctgg atgtatacgt 

71821 gcaagtcaca ggggatgcga tggcttggct tgggctcaga ggcctgacaa taacacagtg 

71881 ctcaaaactg tcagatagcc tgtgtcaggt ctgaagatgt gattttggtt gtttatactt 

71941 ggatgctttg gatgggaata gccctggagt cagcccttca tttcaggcag aggagcagag 

72001 gagtgtgagt tagacacaac tttggatcag gggccaggaa aacttggttg tcatcctgga 

72061 ccagttacta actcgctgtg tgggcaagtc acttcatttg tctaagtctc tctttttcct 

72121 tttgtaaaat aaggatgtcg gcaaatctgt gtggtccctt ccaacagtgt tttttaagtt 

72181 ggtgcctgag tatctgaagc aggagatacg aagggtcatg tgagctgcac attcctattt 

72241 gctccgcagg gaggctggct aagacacccg gcccctgcct tgtaaccatg aacaaggttc 

72301 tcgcaggggc tcgtctcaac tgcaggtccc ctgaaggtct catccttttc tttgctagag 

72361 ggaatttgga tgtcgttggt ccttgcctaa cccttgtctt tgaaagatac agatccaatc 

72421 tctgtgtagc agttaagtga tctgactcag acatatttac tcagtcttct tagagaatga 

72481 gaaaactctt ctcagaattt ttaagaatgt tcctgaagga caataaaagc tctcattcag 

72541 gataggcccc aaaacatttt tttctttata atgtggtgcc atttcctcat tttgcttttg 

72 601 ttcatttgtt tattccttca acaaatattt ttgagaattt gctgagcact aggtattact 

72 661 agatactagg acagtgagat aagtaagata cagcccctat cttcaataag ctgtatgccc 

72721 tgataatgat acccttagtg tcttctacaa gctatacggt catgcatcac ttaacgacag 

72781 ggacacattc tgagaaatgc atccttaggc cattgcattg ttgtgtgaac atcatcgagt 

72841 gacttacaga aacctagatg gcgtggccta ctgcacacat agggtatatg gtgtaaccta 

72901 ttgctcctag tctacaaacc cgcatggcat gttaccatac tgcatactgt aggcaatttt 

72961 cctctccccc taacctgaca gcacccatta cctgacaaaa atggtaagta tttgtgtatc 

73021 taagcatatc tagacataga aaaggtacag taaaactaag gcataaaaag tgaaaatggt 

73081 acacctttac agggcagatc catttacgca accaccactg catgcgtggt ccattgttaa 

73141 tggaaactgt tgtgtggggc ataactgtat ataaaagtat agctacttta attttaagtt 

73201 aaccctggta tgtgggaaaa tttgtcttct gtgctgtctt gcactgaatt ttgcattggt 

73261 atttttccct taatagtggc tgcaaaaaaa cttataaata cagaaccttc ttcatttata 

73321 gaattcttct gcattgacct ggaaaatgac gttgagaatt ggacgttaca ctacaatgta 

73381 gtcctccaat gaaggctctc aatgggcatt tctttaaggc ctaagttaaa gataaaatag 
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73441 aacaacttcc atcactacaa aagatagtgt actcggagga acttgtagag attttttttt 

73501 tcttgtagct gtttttctca ctactcaggt ttcctttttg agttttgccc ctggaggctc 

73561 agagttgaat tctgttggta gtcacttaga acctttctac tgctctgtct ttcctcagtt 

73621 gtgttttccc catgtgggtt tgttttggga aagcagtgga ggggaattcc tcttaggttg 

73681 aaataacttt tagagcgatg gtgccacagt ttacaaatat ttttagaaaa atcctgtcag 

73741 attcttggga acttcagact aacttcacat ctaaagttct cttttctttt cttttctttt 

73801 ttccttcctt ccttccttcc ttccttcctt ccttccttcc ttccttcctt ttttttggtt 

73861 tttgagacag agtcttgctc tgtcacccag gctggagtgc agtggcacga tctcagctca 

73921 ctgcaacctc tgcctcccag gttcaagtga ttctcatgct tcagcctccc tgagtagctg 

73981 ggactacagg taccctccac tacgcctggc taatttttgt atttttagtc gagatggggt 

74041 ttcgccacat tggccaggct ggtctcggac tcttggcctc aagtgatccg cctgcttggc 

74101 ctcccaaagt gctgggacta caggcgtgac ccaccacgcc tggcctaaag ttcttattta 

74161 aaaatttttc ttctgatttg ttagtttaag aaggtaggtt tgaagcagtg accaggaatt 

74221 ttcgggaaat ccattaagga ataaattatt cagtaaaaca gtctcaaagt gagggccaga 

74281 gtgcaggaca gaggcagaga gagatggtag cagtttataa agagaagata cttgattaga 

74341 gaaatcattg tcagagtaac cttatgctta gaaagaaatc acacgcgaag ctctgtgttt 

74401 gaaatcagaa gggaaggtgg gcattcggat gaagaggctg tcggacttgc attactttga 

74451 ccactactgt tgtttttgct gttgtggtgg ttgttgttgt ttggaagatg gagtaaattg 

74521 caagcctggt ggatttcatg tgtttagaat tgtagctaaa atagctcctt acgttgaagc 

74581 atttcctcaa ttctataccc acgttctcag tccttgtgtt actaccagta actcattttc 

74 641 caaaatgcag aattgcattt tacattttag ttctttcaat atttggatca aatacatgtt 

74701 cagtggaaag ggtatgttta atttcctttt ggttcggcac taatttaaat tatgaataga 

74761 gataacgtaa ttctagtatt cgtatttgat tgttaaaata tttggactca gaagtgcatt 

7 4821 tacacgtttc caaatttgac aagtaggaaa aggtatagag tccaaactcc ttcccaggcc 

74881 tatccacagc tacccagtgg cctatcccca ggcaaccagt gttaccagtt tcttgcatat 

74 941 ccttctgcat actaccaata caaaagcatg ttattctgta gccccagcta cttgtgaggc 

75001 tgaggtggta ggatcactcg agccgaggag ctggaggtag cagtcagctg agattgtgcc 

75061 actgaactcc agcctaggtg acagtaagaa cctgtctcaa aaaaaaaaaa aagcatataa 

75121 ttttgttctt cttgttttta tgctgatgag gacatgctat gtccactgtt ccatggtatg 

75181 tgcctttata ttcagttatt cactaaaaaa atgagaagta tagagtaaaa tgaaagtctg 

75241 tagtattctc cactagaaac aggtgtgggg ttcaagggag agtgtgtatt aattattgca 

75301 aaacttctgg ttaggtcatt aatcagaacg ggagccatgc agtggtgaag ggccctggct 

75361 atgatctgaa acatttttct gaagcagtgt tgagttctgt aactgaagtc catggggact 

75421 tgctdLtggc ataactatat gtggaaaatt ttgctttttg agtaggacat tcccatgctt 

75481 cacatttaat ttgaattttg gtatgactgt actaagttaa catagctttt cgcattttac 

75541 agagacattg agtaaatcat tgactaaaga taattaactc ctttatatca Yatgctgaaa 

75601 tgttatgggg gacatatcat tacctttctt ggcttatgga agagatgtaa tctaattcat 

75661 tgagtctgac tgtgtttaag ctattgtttt gggtgtcatc gttggctctt agaacaggga 

75721 ctgggcctta gtccttttcc ttgctcatac ctggcctcct cctagaagac cctgcttttc 

75781 atgttttatg cctcagaacc aagatgtttg gggtcccaaa gtagggatgt gtatgagcac 

75841 attttttgat actttcatct ttctcttact ctctaacatg cgttccgcta gtgtcatgta 

75901 aatacagtga atcagatatt tctctgcctg caaaatgcct tttgccagta tgtgggaggt 

75961 gtgttgcagg taccacagtg gattttctgc atctgttcat acatccatct ttcccactta 

76021 gctgtgagtg gcctcagtgc agaatggtgc cttaactcta gagtctggta ggaaatgcca 

76081 tgcacatttg tcaaatgaat ggaaatgcag ctgacttgct agagctcagt tctgagtgca 

76141 gggtgcttga atgctatata atcaagttga gatcttagtc tggggcagca gaaaacaaga 

76201 aaaggttact gagcaggaga gtgacaggct ttcatttgta tcttaatggg ttaattgcag 

76261 tggcattaga aaaggggaga gattggggta aggcattagt tggaaagctt tgataaagtc 

76321 taggtgagtc aggaaggggt tctgcattgc ttgtttccca ttgctctaac tagatttctt 

76381 agaaaaaaga aaatcttaca ccagactttt acagcgtttg caactgagta aatttcactt 

76441 tgtaataccc tggatattat acaattatat aaagcgcagt ataataagag agttagtagc 

76501 ttcattgtaa tcagtaaaaa taatttttaa gacaattgtt attttttgta tattgtatta 

76561 taacaaaaaa gaagtcatca agtgggagga ggtgttgttt tgagacgttt gctttcatct 

7 6621 cttaatcctg cagacattta tctagtccct tctctgtact cttaaatgct agggatttaa 

76681 aagacggata agatacaggc ttattatgtc tgtgttccag acactggact ccaaacataa 

76741 agcaaacatt acattattct tccttatgta atagaaatgt ttatgtaaga ttgtgtgtaa 

7 6801 atcagtcttg aataaactga attaaatgga atgtgccaat aggagttgct atttagagaa 

7 6861 gccctgtgat aaaacatttg tataataata tacttatcta aacaagccca ctaatttcta 

76921 tgggtttttt ttttttttcc ccccaaggcg aggtatcctg tatattctgt gttgattgaa 

7 6981 ttccagttgg ccaaccacct gcctggttaa tagagtatac cattaactta gtgacactag 

77 041 aaccctgcaa aggaaataag acacaatcta tcgttgttca gtctccacta ttaaatagat 

77101 tttcattcac ttcagcttgg gtgggtgtaa tttgcatctt tcctaacagg caagcatctg 

77161 caagtgtatt gcctgtagaa gctcattaaa aatcagtgca aatcctgaca ctgtctctag 

77221 aagcaggcat taacttgcag aaagtggttt gtatttccag tgtcaKtaga gcttcctcac 
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77281 tcttttcctg cttccatgca agtttagtcc taaactagta ccactatttt agtacaacta 

77341 gtaccactca aataatgctg ctttttaaat aaattcaagg ggaactgcta aggaactgag 

77401 aacctgtaag gtgacaggaa aaaggaaatt ctattttttg gggctagttt gtgtattgaa 

77461 aataattttt gctgagaatc aagctaagaa aattacttgc taatttaaac aacatgacag 

77521 tcctcagaat tttccagcaa cagttaggag cactgtgata aagttggctt ttctgttgag 

77581 aacgttttac ctttttgctt cagcttcttt aaagagtttg aaattagtaa tttcagtaga 

77641 gcagctttgc tgttgcgtcg tactgctcag agcttagtga gctgaagcct tttgggaaaa 

77701 tagcatttgg ggagagactc gtggtgtaat agctcatccc actggcacat gtcccagagt 

777 61 aagctgggct ggaagctttt agtgtagtta aaagatgcca gtctgtcatt tgcatgcact 

77821 gtaattgggc aagtggtttc aggctgagct ttacattatc cttccactga gagcagctgg 

77881 tggtgggctg tagattccat atgagctggg gacttatcat ctggtgtgtt tagtgcaatc 

77 941 ctgcctcatc ttgggagcaa ttttttattg aatgataata ataatgcaca atcttggtga 

78001 aagataatgc ttgtggttat tagaatgtgt agactgagta ggggctttgg catagagatg 

78061 gtaatggtgg ggaaagacat atttaataaa aggattgtaa tggggagaaa gtaaatattt 

78121 tgcaggataa ggaaagccac aaatatgatt aatttaagag tcttaaataa aatgtctata 

78181 aaatgttaga tttttagtac cagtaaggca aagtggccaa tctctagctt cctttataaa 

78241 gtctactcat ccttggaggg tcgctttttg gtgctttttt tgtttgttca gagtcaactt 

78301 cttcctctag ggatatgttt aagtctgtca cctttcctta ggaattgtgc caatctgatc 

78361 atttgttcca ccactgctgc cccgccccct tttcttgact caaagaataa tttgtgtacc 

78421 ctgtctctgt tttttttttt tttctttttc tttttttttt attatacttt aagttctagg 

78481 gtacatgtgt acaacgtaca ggtttgttac atatgtatac atgtgccatg ttggtgtgct 

78541 gcacccatta actcgtcatc tgcattaggt atgtctccta atgctatccc tcccccctcc 

78601 ccatctctgt tttttattat ttatttattt atcgttagag acagagtctc actctgtcac 

78661 ccaggctgga gtacagtggt gcagttatag ctcactgcag cctcaaactc ctgggctcaa 

78721 tcgatcctcc cgcctcagcc tcctgagcag ctagaagtgc aggcacatgc caccacaccc 

78781 agacaatttt tctatttttt atagagatga ggtctctcta tgtttcccag gctggtcttg 

78841 aactcctggc cttaaatgat cctcctgcct cagcctccca aagtgctggg attataggtg 

78901 tgagccacta cacccaggcc ctgtttttaa cttgcaatac cttttctgca agattgaatt 

78961 tatattagat tataaagttt gtggacaaaa tagaacactc cattataaaa gcctccttca 

79021 tttgttttgt tctctggtgt ttgatttgac tgatgtggat ttgagtatgg aagtgttcca 

79081 tgccttatta aggaaagcac tttgggaatt ggccagggcc cacttaactt agtttcagaa 

79141 tggcagcact ttggcagtcc tcagtttctc ttatttccta acccatccct actcattaag 

79201 acggggctat tgcattccat tttcagggaa tgctctttca tttttcgttg gtgagagaac 

79261 atgaatgcct cttaaaatgg tggtttgagc ttgctgagaa ttttagggga tccacagagt 

79321 tgaaaagtct tacaggctat cagtgaagca tggagggttt tcgttatgaa aatgtcctga 

79381 gatgggggga agactggaca gatgaggtag gggagcctcc ttgcaaagtt agaattcagc 

79441 tgtttatact ggtaacagaa tctgctttag taaggatgaa gcaaaaagaa aaacgatatt 

79501 aacaccttga gaaaatctct gtattgtgag cttaatccaa caactccaac gatgttagct 

7 9561 actttttcaa aatacatctt agcccttgtt acaataacat ttacttgtgg agtgaaattt 

79621 tgtattaggc agattcactg taatcagtaa tcttccttcc attgagattc ttcctgttac 

79681 tttcttattt aaaaaccttc agtagcctgc atcctcaggg ttctgtttac agagaccctt 

79741 aacagtctat tcagccttac ttctgccatt atccctactc taggctgagt gttaaccata 

7 9801 cctacctatg tacatttgca gctgtgccat tttatctgtt tggattctta accccctctc 

79861 ttgcccaagt gcctagtggc gccctgttgg cacgttgagg ggtagcttaa gtgtttactt 

79921 ctttcccgac tccctcagtc cgtgttagca gtaccctctt ccatactctt ctattatctt 

79981 ctgcatactt ctgtactcca cctatctgtt gccatgaaac aggtcacccc aaaacagtgg 

80041 cttagaacaa tactatggac tgcgtttgtg tccccccacc ccaactcata tgttgaaatc 

80101 ctggccccca gggcgatggt attaggagat ggggcctttg ggaggtgatt aggtcgtgag 

80161 ggtggagccc acatgaatgg gatgtcttta taaaagagac cccagagagc ttctttcctc 

80221 ttctaccata tgaggacact gtgagaaaac atctgtgaag cagaaagtgg ggtcctcacc 

80281 agacacataa tctgcttgct ccttgagccc ggactccttg agcctggact cctcagcctc 

80341 cagaactatg agaaataaat gtgtgggKtt tttttttttt tttttttttg gtcttgtttg 

80401 tttgtttgag acagggtctc tctctgttgc ctgggctgga gcaaagtggc atgatctcag 

804 61 gtaactgcaa cctctgcctc ctgggctcag gtgattcccc cacctcagcc tcccaggtag 

80521 ctgggaccac gggcctgtgc caccacgccc agcttatttt tgtatttttt gtagagaggg 

80581 gtctccccat gttgcccagg ttagtctcaa actcctggaa tcaagtgatc tgcccacctc 

80641 agcctcccaa attgctggga ttacaggcgt gagccaccga acccagccag atgtttgttg 

80701 tttaagccac tcagactatg gtatttttgt atcttccatt gtacttaatc tatgggtctt 

80761 atattcagca gtattctatg ttatgtaagg agttgtggcg tgaaaccact tccttcatgt 

80821 ttttatgttt ctttttaatt aaattttaag cctgggaatc ttggtaatga catattatat 

80881 gcaaaatatg taatattaga cttgattttc atcagatcag ttttagcact tttcagtgtg 

80941 caaaggtggg tggtaatggc tttttccaaa gcatagtccc ttggttgtat tacactatta 

81001 atgactatgg gtagtgtcag accccgagcc caagccaagc catcacatcc cctgtgactt 

81061 aaacgtatac atccagatgg cctgaagtaa ctgaagatcc acaagagaag taaaaatagc 
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81121 cttaactgat gacattccac cattgtgatt tgtttctgcc ccaccctaac tgatcaatgt 

81181 actttgtaat ctcccccacc cttaagaagg tactttgtaa ttctccttac ccttgagaat 

81241 gtactttgtg agatccactc ctgcccgcaa aacattgctc ttaacttcac cgcctatccc 

81301 aaaacctata agaactaatg ataatccacc accctttgct gactctcttt tcggactcag 

81361 cccacctgca cccaggtgaa ataaacagcc atgttgctca cacaaagcct gtttggtggt 

81421 ctcctcacat ggacgtgcat gaaaggtagc ataatacaat ttttgctgaa gttttgttgc 

81481 tcttctactt ttaaattaag gaccagatag gaaaggacat atggtaatct atcaaatata 

81541 tagccattga tattttcttt gttttctttc actaaccggt ttatgcaaga tttttttaaa 

81601 aaaacagcaa aatgaaaagg ttcgtaatat cctgacttcc tgttttatta taaatgagat 

81661 gaacaccagg tagtgttaat ttcctttaat ttctttatga gtcatcctca aattaatgtt 

81721 tgaggaatgg ttgggtgttt tcaaatatct ccgacatctt ttagttagac agcatagtga 

81781 aatggaaaga aattttgaat ctgctagact caagtttgca ttttggctct gtcatttact 

81841 catagatgta ctacttggag taatcatact gattttgtag accaattgga aagataattg 

81901 accagtgttt taggqtatat gtatatacac aaaatttgac agacatgtaa tcacacaatg 

81961 cttaagcccc atatgttatt aagcccacac tttggtaaat aacatttggg gcttaagcat 

82021 ctgtgtgatt ctatatatgc cggtatttgt aaatgtaaaa gtgataccag ctaaatcacc 

82081 catctttctg ggaactaagt acaaaggaat ggttctgttt ctaacccttt ggcttagagt 

82141 atcatagagt tttaaatttg gccagatttt aatttgacag ataaaaaaat gaggcacaga 

82201 taaaaaaatc gagaagttga atgaattatg aaaactcact agaggcagag ttgaggacgt 

82261 tgtccagtgc tctgttgtat cattatgtca ctgcctgccc tactgttttc ctgctctttc 

82321 cctcttcttc ttcctccctc tcagccctgg tccacctacc ttagtgcatg cacacacgta 

82381 tggcaaagat caacaaggat gctgggttca gggtcctttt cccactggag tctggcaaca 

82441 ctgttcctgc cttgatatgg ttcttggttg ttggYcactt aaaacaatgg catattttga 

82501 agggctgaat agacttcctt ctttttaagc ttttcttttt atttatactt tttggatccg 

82561 tgtaatttct ctgagtcatc cacattattt tctacttttt ctcagttttc atgacctgta 

82 621 ttaccattac aggtgtctct gctataattc aatatacata ttcctgaaaa cctcatgttc 

82681 tacaaaatca tacattcaga ataatctggc taatgggaaa tattggtttg gggcagtctg 

82741 gcttatggga aatattggtt tggggcagtc tggcttatgg gaaatattaa gttgggacca 

82801 accactctaa atctatgcta ctttgcaagc acagcactaa caaaaacaat gaaaaccatc 

82861 ataaaacagg agcacagttc agaagacata ctacattcct actatataca gatacacttt 

82921 ggtaaatatg actttaactc atgaaaatat gcggggctgc ttgatggaag gggatgtaag 

82981 gaaggatatg aggctggtga gtactgggag acaagaatqa aacacaccaa gacgtttgca 

83041 tgagatcatg caaagagaat catgcagaag gtacatctaa gacacaaggc caaccgggca 

83101 tggtggctca tgcctgtaat cccagcactt tgggaggctg aggtgggcag gtcacaaggt 

83161 caggagttcg agacctgacc tttaacagtc tattcagcct ggcaactatg gtgaaatccc 

83221 gtctccacta aaagtacaaa aaattagctg ggcgtggtgg cacgcaccag tagLcccagc 

83281 tactcaggag gctgaggcag gagaatcact tgaacttggg aggcagaggt tgcagtgagc 

83341 cgagatcaca ccactgcaat ccagcctggg cgacagagcg attctctgtc tcagggaaga 

83401 aaaaaaaaaa aaaaacacaa ggcccggcag gctgagacca tgacaggaac ggttagagtg 

834 61 gaccctggtg cagtgtgggg tgctctgttc agcctcacta tgaatttcac atccagcttc 

83521 tgttacttgg agatataaaa cagtaatgtg tggagacaaa tcgtggatga accaactcct 

83581 gagttatggc gtcatcaatc tgctatgacc agtcaaattc gcagtataga aacatgtctt 

83641 gtagctggac agaaaacatc cagctctact gctgctaaac atcatggaat gtacccattg 

83701 gtttatccat atagtaagtt ttaacattat ttttaatata ttgttcattc tgaatttgac 

83761 agtggtctgc caagtctccc tgaactccac aaattaaata aaggtatctc agagggcctt 

83821 tcccaagaac tttaatgcca tcttcctagc aagctataga aaacatttga aaaccccaag 

83881 ggcaaagttc cagggctgcc taggggcaaa atcagtgaac agagtagtaa actgataaaa 

83941 ccagcataga ggcctctgtg ggggaaaaag acacctttct gcagttaata aacagtacca 

84001 aagaacactg attgtcttct cagggtttgt agcccatttc tgttttttat ttttaaatct 

84061 cacctgttat ttttgttttt agctgttcca tagtagtctg atggaaatac actttatgtg 

84121 ttcttgtgtg ccaatcaaaa ataaaataag cacaatgaat acctctaatt gcatagttca 

84181 tgaaaggctt gaaaagatgc agagcagctg gctaatgctg ccaatgagcc attggctgga 

84241 gccccttgta actcagcctt tgattctgtc ttcagcaagc cccagcatcc tcagggccca 

84301 tgtgatgggt gctcagtgga actccagact ttgtggaggg ctctgtggtt ctgctgctgg 

84361 ggaaactgtg tgtttctgag cctatgggaa tgggtcagaa agcctgggaa atgggggaga 

84421 ataggagcag gaacacaaat gaggagccag caatgcaggt tgctattatg tcgcatgtca 

84481 cttcctacga atatcttgat agattagctt ccttcaggcc caaaaccttg aatggacatg 

84541 aaccacagta tcagggctaa atgagtaaca gcccaaacca ggtccactct ccagacagtt 

84 601 agaaaaggta gacatctctt gtagctggat ggagcagggg tctcccccag gggtggtaat 

84 661 tcagcaggtc tttaaagaat gaataggatt aaagtaagtg aaaatggaaa gctgaaggca 

84721 gggaaggaaa gtgaggctca gggagaattc tgggcaaggg aaagttacag agtgattcct 

84781 tagagcttgt gagtctaaca attttgatgt caatggacct tattcttgga accacagggc 

84841 aactgattga ctttagtttt cttttttgct atattcttgc cactcactca atgactgctc 

84901 attgaaaatt gatgataatg gccatatgga aagtctcaga gcttatctag gatttgagtc 
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84961 


cctgaggcat 


aatcagtggg 


tggaacagca 


gatggatggg 


gcagactatt 


tttctgttcc 


85021 


ttcctgtgac 


agttgattat 






ctctttgtga 


tcttagcttc 


85081 


caaaattacg 


gaaggttatc 


actatttatt 


attactactt 


tctcagcaaa 


ctcaaagaat 


85141 


cagagtgata 


ttgattattt 


agatttagca 


atgggggact 


taagctctta 


taaatcaggg 


85201 


tcatctgaat 


ctaaagatgt 


atgtctttct 


tatttcagtc 


tgaccagtta 


tttaagaatc 


85261 


caagatacct 


ttacttttta 


tttgaagttc 


cttaatttga 


taagagctct 


accaagcagt 


85321 


tggttctgtt 


tctctttcag 


ttctctgcca 


agcttttgtg 


tgtcttaaga 


cactgggact 


85381 


gggaaaagac 


tgcagtgttt 


gttaaatgta 


aa'tta'tac'tg 


agccttgagg 




85441 


atggggagag 


gtacattttg 


gctttccttc 


ctttgtaaaa 


tgcagttaaa 


ctgatatcca 


85501 


gaacaatggt 


gagaaaacac 


caattaactg 


aatgtatgaa 


agacagtaaa 


ataaacaagg 


85561 


tgggcagtcc 


tgtatgtgag 


gcatttttaa 


actctttgct 


gtgtaagtgc 


aggcttttgg 


85621 


aaagatctgc 


ttttcacttt 


gatttttgca 


gtcccacacc 


agcttgcttc 


tggttctgcc 


85681 


ttcagggaca 


tatgtgctct 


ctagatctgg 


gaatcttttt 


gtctggagac 


ctcaagattg 


85741 


ggatctgcac 






tgaactctat 






85801 


ttgaggcttt 


ttgcaagctt 


tcaactctga 


ggtattttgg 


aaaactgtaa 


ttttgtttgt 


85861 


tctctgctgt 


atgcatgtgg 


tttcttaaaa 


cacgagtgtg 


attattttct 


gcatcctttc 


85921 


catagcaaac 


atttccctca 


aactacatgc 


tccaaaagta 


ggttctcact 


gcacttgcca 


85981 




ttgcttattg 


agagtgactg 




acaaatcaac 


tggttacaat 


86041 


tttgaggctt 


tttaggtggc 


ttccaaactt 


gtcttggggt 


ttgtccctca 




86101 


agccaaacat 






cccagcaagc 


cataaaacac 


tattgactta 


86161 


ttatgaaact 


taatttgatg 


ataatgaggc 


aggtctataa 


aatagcactt 


tgggtagttt 


86221 




aatagcaatt 


tgtggaagac 


cgttagtatg 


tagcatggac 


tctgaagcta 


86281 


gagaacttga 


gtttgagtct 


cagcttttct 




tctagtaatt 


ttatttattt 


86341 


tatgttttac 


tgttaaattc 


ttaacctacc 


tggaatttag 


tttggtatag 


ggagtcatgt 


86401 


ttggatctag 


ctgttgattc 


tttttaaata 


taatttttag 


tgattttttt 


attgtgataa 


86461 


aatatacata 


agatttatca 


ttttaaccat 


tttaatttag 


atgtacaatt 


cagtggcact 


86521 


aaacacaatc 


acaagttttc 


cactttttag 


ctgtgttaga 


ggtcaaatta 


ttttaccttt 


86581 


ctgcgtctcc 


gtttcctctt 


ctgcaagtgg 


gaataatact 


cccattaagt 




86641 


ttaagactta 


attcaggccg 


ggcgtggtgg 


ctcacacttg 


taattccagc 




86701 


gccaaagcag 


gtggatcact 


tigagg'tcagg 


agttcgagac 


cagcctggcc 


aacatggtgg 


86761 


aaccccatct 


ctactaaaaa 


tacaaaaagt 


taactccgta 


tagtggtgca 


tgcctgtagt 


86821 


cccagctact 


caggaggctg 


aggcagcaga 




cctgggaggt 


agagtttgca 


86881 


gtgagctaag 


attgtgccac 




cctgggtgac 






86941 




aaacctaaat 


tcataggaaa 




cttgatgtgt 


acaggacaaa 


87001 


ataagtgctt 


aataatagtt 


gatggccatg 


aggatttgtg 


gtactggcag 




87061 


gcLagttaag 


aagcattttt 


tgagcctggt 


grcatatgtc 




atatgcctgt 


87121 


gtggtcacag 


taattgccgg 




gtqqaqacct 




cattggtaac 


87181 


actggataat 




tttttatgtc 




tgagactaat 


atacagtttg 


87241 


aacttcaggg 


gagaaaagaa 


gattttctaa 


cagatttata 


tttcaaattg 


tttacaagac 


87301 


aattgaaaat 


gaaaagataa 


aacctactta 


ctcttcaagt 


tcaaggcata 


atgcaagagg 


87361 


cttgaagagg 


gtagacttct 


gaatatattc 


cataaacagt: 


atctcttttc 


agaggttttg 


87421 


agaaaaccct 


tgtttaaaaa 


aaatatgagc 


aacaattatt 


tgttattcat 


gaattctaga 


87481 


tgtgtaaatt 


gtgtgatcaa 


aatgagagga 


aaaagagaag 


tctcaagaga 


ataagcattt 


87541 


tgtctacatt 


taagtctttt 


ggaaatgata 


tgtttgaaaa 


taatataccc 


gtitaaatiata 


87601 


tttccttaaa 


ttatttagat 


aaactgtttc 


tattgtagtt 


tctggtaaac 


agtaatatga 


87661 


aatggtttct 


attttattgt 


caacaacttc 


attacttttt 


aatgcagcaa 


aaagcttcca 


87721 




ttaaaaagac 


tgcagaaatc 


ctctgaattg 






87781 


aacattcatg 


cttccacagt 






accatcacaa 


aaaatgtctt 


87841 




gctgccctag 


ctaaatgtga 


actttaaaaa 


ttggaaaatc 


aggcactttg 


87901 


tagacagttc 


actctttgtt 


tcatattttt 


gggctaagga 




cccaaggaag 


87961 




aggagtctaa 


cttctcaata 




tttagcctac 




88021 


ttgttttggg 


aattgatagt 




ataacaaaag 


aatattgatt 


tctaaagcat 


88081 


ggatcgtgtt 


ggttcctaaa 


aagtgagtga 


taatgtagca 


attaaattct 


gccctgtatt 


88141 


atacacagtt 


ggtcagggta 




gccatcatca 


ttattgaatt 


gtgtaataaa 


88201 


tacttggggc 


agagtgccat 


agtggaaata 


aaactacatt 


ttggtttagg 


acttcctgcc 


88261 


ctagcccatg 


gcctatagag 


ccaagccttc 


acgtttcaag 


aagttgtgag 


gggaaagaga 


88321 


caggcattgg 


cctcatggta 


gcagctgctg 


gagggttctt 


ttcctagctc 


attttctttc 


88381 


ctcatgtttc 


acatttttcc 


agggtgacat 


cacgtgtggt 


aactatttgt 


atggatcttc 


88441 


caagcccgca 


ttttagccaa 


gacatttctc 


atgggctcca 


gacccacaat 


ctggtggtca 


88501 


acatactctc 


tacttggttg 


ctccatagat 


accctaagtc 


aaggcatgaa 


tatctaaact 


88561 


gatcgtgact 


ctcctctctc 


ccaccatcac 


ctccaaacct 


gtttcttttc 


ctgcattttt 


88621 


tttttttttt 


ttttgagatg 


gagtctcgct 


ctgtcgcccg 


gctggagttc 


agtggcacga 


88681 


tctcagttca 


ctgcaacctc 


catctcccgg 


gttcaagcga 


ttctcctgcc 


tcagcctcct 


88741 


gagtagctgg 


gattacaggt 


gtgcaccacc 


atgcccggct 


aatttttgta 


tttttagtag 
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88801 


agacggggtt 


tcaccatgtt 


gttcaggctg 


gtctcgaact 


cctgacctcg 


tgatctgccc 


88861 


gcctcagcct 


cccaaagtgc 


tgggattaca 


ggcgtgagcc 


accgcgccca 


gcctttcctg 


88921 


catttctttt 


gagcatctag 


cacgtttcca 


gtcattcatc 


cacgtcagaa 


agccccagat 


88981 


gccaccagct 


cttctttgct 


ttcaatgttt 


aatggaacac 


taaggtctgc 


gaattctacc 


89041 


tcacaaatgt 


cttccctggc 


gaatcctcct 


ttctgtgcca 


cccatcttca 


ccttatttcg 


89101 


ggccttcctc 


atttctcacc 


tggatctttg 


taaccatggc 


ctctagagct 


ggctccctgg 


89161 


atccctactg 


gccatcaatc 


ccatgtgcct 


cagccgtaat 


gccagtcaca 


gttctcaaag 


89221 


agctattctc 


ttcctccctt 


cctcattcag 


gctgtgcctg 


gtaatctgga 


gtgaattttc 


89281 


cactccatcc 


tgcagacttc 


caaatactta 


tccagaggct 


cagcttacag 


ctcaatgcct 


89341 


ctttgaagcc 


tttctcaacc 


tttgcagagt 


tgagatataa 


ctatgctgct 


aaatgtaaat 


89401 


ctaactcttc 


atacattttc 


tcttgtttac 


ctgtctcccc 


ctcctgtcag 


agtctgagtt 


89461 


acttgaggtt 


gtgaattgag 


tcttatttat 


gtctgtctct 


aatatctagc 


atggtgtgtc 


89521 


tcattgatag 


cagacatcta 


taaatgagtg 


agtaagtgag 


ccaacgtgtg 


ggtggttttt 


89581 


cctggggcgg 


gaagggttgg 


gagggtacca 


cgtagtcagt 


gtgagcctcc 


cttcgggctc 


89641 


tttgactttc 


cgatgagcat 


gcttactggt 


agtgacttcc 


cttgaccctt 


accagttaga 


89701 


tacatgctct 


gggaccagag 


ctgcagaata 


taaaccagta 


gttttgaaaa 


gttattatta 


89761 


tctccataag 


gattgggttt 


tttataggca 


caggtaacaa 


ggttgaacac 


aaccaagtga 


89821 


acccagtgaa 


ttctatttat 


ttcagLgagg 


tgtctgtatg 


cactgtggcc 


atcccagagg 


89881 


ccttactttc 


aattcatatc 




agggggcttt 


actgccaccc 


agcactcaca 


89941 


ccgccgtgct 


cttctgcaga 


gactttcccg 






actttctcct 


90001 


gcacttaaac 


ctttagcata 


tctccatccc 


tgcctctcag 


ctaacggggc 


ttgcaggcaa 


90061 


ttagaaagag 


cttccctgtg 


tacctgccac 


cacatcttct 


cagttgtctg 


cactcgctgg 


90121 


ctagtctccc 


ttgctgtgga 


ggaattgttg 


atgcccctgg 


gaaggccaca 


ccctccactt 


90181 


gaaagatccg 


gtcccctccc 


accttttttt 




ttttgagacg 


gagtctcgct 


90241 


ctggtgccca 


ggccagagtg 


cagtggcacg 


atctcagctc 


actgcaagct 


ctgcctccca 


90301 


ggttctcacc 


attctcctgc 


ctcagcctcc 


cgagtagctg 


ggactacaag 


tacccgccac 


90361 


cacgcccagc 


taattgtttt 


gtatttttag 


tagagacggg 


gtttcagcat 


gttagccagg 


90421 


atggtctcga 


tctcctgacc 


tcgtgatcca 


cctgcctcgg 


cctcccaaag 


tgctgggatt 


90481 


acaggcgtga 


gccaccgcac 


ccggccccct 


cccacctttt 


taagaacatc 


actccatcag 


90541 


ttgtctcctt 


tctcttctga 


ataatcagtt 


ttcccctggc 


caccaagcta 


ttcccatcag 


90601 


catacaaaca 


tgctgatttc 


ttcccccata 


ccatgcccct 


ctccagcttc 


catttatctg 


90661 


ctctgtttta 


ctttgacttg 


cctgaactca 


ttataactta 


ttcctctcct 


ttcagtctct 


90721 


ctctttgcaa 


agtttttaaa 


gcttttaatt 


ttggLaagac 


atatacaaca 


caacatccaa 


90781 


cttaccatgt 


caaccttttt 


tttttttttt 


tttttttttt 


tttgagatga 


agtctcgctc 


90841 


tgtcacccag 


cctggagtgc 


aatggcacga 


tctcggctca 




tgcttcctgg 


90901 


gttcaagcga 


ttctcctacc 


tctgcctccc 


gagtagctgg 


aattacaggc 


atgtgccaca 


90961 


tgcctggcta 


atattttttt 


gtatttttgt 


agagatgggg 


tttcaccatg 


ttggccaggc 


91021 


tggtcttgaa 


ctcctgacct 


caagtgatcc 




gtctcccaaa 


gtgccaggat 


91081 


tacaggcatg 


agccactgcg 




ctcaaccatt 


cttaagtgtg 




91141 


atgtcaggta 


catttacatt 


attaagccac 


caatctgcag 




tcttgcaaaa 


91201 


ctgaaattct 


gtacccagta 


aacattaaca 




ccctcctcca 


agcccctggc 


91261 


aaccatcctt 


ataccttcta 


tctctgtaaa 


tttgactact 




gtccattctc 


91321 


tcctgaaccc 


accttcatca 


ggcttttgct 


cctacattcc 


ccctccacca 


cttttctcag 


91381 


ggtcacctgt 


ggcctccact 


tgctggatct 


gacagtcatt 


tctcagtctt 


tggctgattt 


91441 


ggcccggcag 


cagctatagg 


aacagtggag 


cactccctcc 


tctttgaaac 


actgtcttca 


91501 


cttggtttcc 


agggaccttg 


ctggcctgtg 


ttttccttct 


tcctcacaga 


ccactccctt 


91561 


ttagtgttct 


tcctcttgtt 


gcctctgtag 


gttggaatgt 


cccagggctg 


agtgcatggc 


91621 


cctcttctct 


aatcttcact 


cactcctctt 


tagagttgat 


gtctttccat 




91681 


ctcttacagt 


tcccaacatt 


tgtcccgagg 


cccagctgtg 


aacacttcct 


tacctgcata 


91741 


tctaataggt 


gtctcaacat 


ttgcatctcc 


agaatgactc 






91801 


tgccataggc 


ttccacatct 


gaggatggca 


actccgttct 


tccaatttct 


gaggccaaaa 


91861 


ttgggaatca 


taattgacta 


ctctttttct 


ctcaaacccc 


catccaatca 


gcaaattcag 


91921 


ttggctttac 


tttcaaaaga 


tacaacctga 


atgtgaccac 


ttcttgtcac 


ttccactctg 


91981 


ccactcaggt 


ctaaccacca 


tcttacattg 


cttgggttgt 


cacctcctcc 


ctggcctccc 


92041 


tgcctgccct 


agattccaaa 


cacaaatcta 


gcgagaagag 


cctatgaaaa 


tgtgagccac 


92101 




aagccctcca 


gtggccagga 


aatggcagac 


ttttacagtg 


gccgaaagct 


92161 


ctatgtgatt 


gcctactcca 


tgacctcatt 


ggcctctttt 


cttgattttc 


tcacccttgt 


92221 


cccctcgctg 


gagtcccaca 


gccttccctg 


gtgtgcctgc 


agtatgccat 


cctgctgcta 


92281 


ccttagggct 


tttctgctct 


ctgtccctct 


ggggaaaggc 


tcagccctta 


aggatcccgt 


92341 


gccttcctct 


cttctttctg 


tcagctctgc 


actcagctgt 


caccttggtt 


gcgaattctt 


92401 


ccctggtcac 


cctgtttaaa 


gttgccacgc 


ttctcccgca 


cttcgtagtt 


ttttttttct 


92461 


tMttattttt 


ctttctagcR 


cttaccaccc 


tgtaacatgc 


cgtatatttt 


acttgtcatt 


92521 


tgtgtttgta 


tcaccccact 


ggagggtgag 


ttctagagag 


cgggatttcc 


ttgccagcgt 


92581 


tcgctggatt 


tgcccactgc 


ttagaactat 


ttctggcgca 


tagtaggtac 


acagcaagta 
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92641 

92701 

92761 

92821 

92881 

92941 

93001 

93061 

93121 

93181 

93241 

93301 

93361 

93421 

93481 

93541 

93601 

93661 

93721 

93781 

93841 

93901 

93961 

94021 

94081 

94141 

94201 

94261 

94321 

94381 

94441 

94501 

94561 

94621 

94681 

94741 

94801 

94861 

94921 

94981 

95041 

95101 

95161 

95221 

95281 

95341 

95401 

95461 

95521 

95581 

95641 

95701 

95761 

95821 

95881 

95941 

96001 

96061 

96121 

96181 

96241 

96301 



ttccttgaat 

acttcagcct 
gagaacaaag 
atgtatagat 
gtgacaaata 
catataaaat 
aattatattt 
aacttgtggc 
cctttgaata 
ataccatatg 
ccttcattct 
cttagtggaa 
tctttaaaaa 
aattttatct 
cattgtatca 
atctaatcaa 
acctttgcaa 
agtatataat 
ctgtttcacg 
tttctggttt 
tttatggtta 
ttcttgattc 

atcccagtgt 
ggattcattt 
attagtaact 
aagagactgt 
ataatcatgt 
tgagacctct 
tgctatctgc 
tgttcactgc 
ctgccaggaa 
tgtactcatg 
ctttctgcac 
actgtcattt 
cttcccagtg 
agggaacaga 
ctaatttagt 
aactattatt 
gcagttattc 

acagatgcag 
agtggtggga 
gtttccagca 
atgggctctt 
aataaatatt 
tcctaatctt 
catgactaaa 
gtagaacaga 
aagacctcat 
ctttgttgat 
actgcttaat 
caggggtgag 
ggcagttctg 
gagtgggaag 
tggatagaaa 
agggatgaat 
cttcaccaag 
cctcagtact 
gagtgaatga 
tctgaaagta 
ctggtaatga 



gactatagag 
cctaaactgt 
ctgaattaaa 
gaaattataa 
tagtatttgt 
ttactttctt 

aataattcct 
gcttttgaat 
gtataaattt 
actttatgtt 
ctataattta 
tatactgata 
ttgctaactg 
aaataaaaaa 
tcacatacgt 
ggggggtggg 
aaaagaaaat 
gtatatttga 
ggtcattttt 
agagtttcaa 
taatcctgga 
ctctgccttt 
agtaatgtaa 
tctttgcatt 
agttttcttt 
taccagctgc 
gatttagtga 
gccctccctc 
atgattgctg 
ccagggtctc 
gccctttggg 
tttattccca 
tcagagagta 
gttgaactcc 
tttataaggt 
tacattgatc 
ggaaagagaa 
tatttatttg 
ttagagtgct 
cagtcctttc 
aaattgaggc 
ccagagtttg 
agataggaca 
tgttctggag 
caattcaacg 
catggagctt 
tcgcattgct 
ggaagaaggt 
ggttcatatt 
ctcttaagag 

tcagcccatg 
aatgtgtgcc 
tgagggtcag 
gatgtagctg 
tgtccaaaga 
gcagtggtcc 
gggaaaacag 
cagattcacc 
aatttccacg 
ttttgacttt 



aagaagcttt 
aggcttcatt 
cattgcttca 
tttaggttgc 
attatataca 
atgggacatt 
ictggca 
cctttgactt 
ttttatctcg 
tgaattctgc 
taacacaagt 
ctgcaggtaa 
catttccatc 
aagatcttac 
agtttgatat 
ggacgtgctg 
tagttttggc 
ctaagaaatt 
cttcaacagc 
attataaaag 

atgataacat 
tttctgtctc 
atttgggcat 
tccgaaatta 
gaattatggt 
tcagctttag 
aaggccattt 
tcacacccca 
ttggcctgct 
aggatccctg 
ctcttctgca 
tacttctctc 
tcagaggagg 
agacactgct 
gcaactgaac 
attgaatgga 
accttcaaac 
tgtctcatgg 
tcactctgtg 
actaattcaa 
atagagaaat 
aactcaagct 
gtgcccatct 
aaaatattag 
aatatttatt 
acgttctagg 
atgttagaag 
taagatggtt 
ggagaattgt 
atgcaggctg 



cacaaaacag 
tgaaatttca 
aacgatatct 

agaaaattac 
aaatatatta 
atacctttga 

tttgaaatta 
atgatttcta 
atttggaaaa 
aaaggaggat 
ccaaattagc 
atggcctcat 
gccactaaaa 
tttatactac 
tttccttatg 
gaggattatg 

gctttctatt 
attagcaact 
accagcaaag 
cctgtagaga 
gcagaaagtc 
ctcaccttgt 
atcacataga 
tcaggcaata 
gagctctaaa 
gagaatccct 
cggccaggaa 
tcctaggcag 
cccttgggga 
tagaatttaa 
cctggtgtcc 
aactgctttg 

gctagagaaa 
taacttacag 
aacccagtaa 
gcaggcactg 
tggagccagt 
taagtaatta 
gtctgacatg 
tgggctgggt 
ggacctggtc 
aagctcttac 
aagggaagag 
gtt 



aaaagcgtaa 
agttcttatt 
ttgagtcgtt 
tttatacata 
tatggatttg 
cagtagcttt 
gggttacttt 
atgttaaaac 
accagttact 
cttcaaaatg 
cctgagtagt 
tcatttcttt 
aagtctgtta 
gcctaagatt 
ggctttttta 
aaaacctcta 
tttttatttg 
gttaactgaa 



gaa 



latcgatt 



ggttctgctt 
ccatgggtgc 
acaccgagga 
cgattcttct 
gggcacatga 
caggaagagt 
caggttccat 
ttttacccag 
ataacatgtt 
caagatgaga 



:atg 
atgaaagaag 
tgatcacatt 
agatcttgac 
ggaagcgttg 
tattggctgg 
tgagttatgg 
cattgcggag 
agagagaccc 
ccctagatcc 
agcaagaggg 
gcacttttgt 
ttattgagac 
tatgRtgtca 
tcttctcttc 



ttccagcaaa 
cacagcagtg 
cagcagccgt 
gaagtccact 
aatgtgcctt 
agcagttgtc 
ctgtctgtgg 
aggtatttgt 
gcctcacagt 
agtctcctgc 
gatggtccga 
agtcccaagc 
ccagtgtccc 
gtcatgagct 
tagttatatt 
catgcatcat 
agttggcact 
actgttaagt 
attgttcact 
tagctaacat 
ctctaagggc 
actattaatg 
cttccaagtt 
aggacctgga 
ggaaacatgg 
agagctcact 
tgtacaacaa 
agaaaattca 
tgctaagata 
ttttaaggag 
tctgttgttt 
tcactcaaaa 
ggttgtttgg 
gtagcactcc 
gctgggtggg 
ctgagagagg 
ttgcctgcct 
tacctctacc 
tggagataac 
ggtgggccag 
atcatgagga 
tttggtcttc 
tgaccataaa 



ttttggataa 
tggcacggaa 
gatgtaatta 
atgcctgctg 
ggttatctgg 
atgttggaag 
gagggacaat 
atttactaca 
aataggagag 
gatcctcttt 
ttatttgtga 
gtgtaataga 
ggaaaaaaac 
gaacatttct 
ccttgtttca 
agcgtccaca 
cttttctctg 
agcatccttg 
agatccctta 
tctggctggt 
tacacttcaa 
ggaaagtttc 
cttcctctca 
cacacacccc 
agtgttgtac 
tacagtaatc 
gtgtgtctcc 



attgtai 



itgat 



tttatatgta 
tcctcatgtt 
catgcagcta 
accagcctgg 
cactgaagac 
tacttttagt 
tgaacaaagc 
catctatcta 
aaaagaaaat 
atgatcaggg 
gacctgactt 
cttgtaggga 
tctgcagagt 
cttactggRg 
gtgtctagag 
aagttcagga 
aggtgaagat 
caaggactat 
agagcagccc 
ggctttagat 
gtgttaagat 
ctctgtggaa 
acaagatgtt 
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96361 gcaaacgagg ttcccagctg tcctggtctg tcggcatggc tgctcgatgg ctctggtggc 
96421 tgataactca ggatgttgtt gctggagagt ggggaactcg tccagatcct cctgagacac 
96481 cactttcctt tttcatacaa 



ERG genomic sequence (SEP ID NO: 4) 

>21:38783451-38882000 



1 


tgacagattc 


taaagtgttg 


tttcaacagt 


ttcttcttcc 


tcaaaggtga 


gcatcatgtg 


61 


cacactgtca 


aaattaaaaa 


aaaaaaaaaa 


aagacccaat 


aaaacccgat 


gaaacccaaa 


121 


gagcatggag 


atgagaaact 


cactgctggc 


tgacgatctg 


aatgaatggc 


cttgggggga 


181 


atgtgaatgt 


ggaggtagca 


gaagggtcct 


gtcttcagtg 


cttgattctg 


Watcaatact 


241 


tggtaaattt 


aaaaaacaaa 


aaaaccagca 


agggagattt 


taaatcatca 


aggcagaagg 


301 


atttaaaaag 


tggaatgtgt 


taaatgcaag 


agtgcgggat 


ctttgcataa 


agttattata 


361 


ttttcaaaaa 


ttcgacaagc 


attgttttta 


ccaaacgtgt 


ttaaagtaac 


agccctgagg 


421 


ttaatgttat 


gattggattc 


tgatgtggca 


ctttaaaaat 


aaatcatttt 


gcttggcctt 


481 


agcatttctt 


ttcaaaatgt 


ctctgtactt 


atctgtgtac 


aacagcagat 


taccgttctt 


541 


cccagagttt 


aattccatga 


acaccctgct 


gaaagacttg 


ttaaggcctt 


atggaaaact 


601 


ctgacctgcg 


tgttcattat 


tgacgcccaa 


gacgagtgca 


aggaagttga 


tacttgggat 


661 


gaggaaattt 


aaatcagcag 


ctgatctgga 


atctggaagc 


actgcccctt 


ggatctcgcc 


721 


aaatatggaa 


agaattcttg 


tcactgagca 


aatatagaat 


tcagtgctag 


agttttacct 


781 


cttgaatttt 


ctagaacttt 


gttttgatta 


gccacctaga 


gtataagaat 


tgctgcctat 


841 


gggatttttc 


ccagaagagt 


agataacata 


aagtaaccag 


cRataataat 


tttcattatc 


901 


caattatttc 


atgtgtctcc 


ataacactta 


atacatcatg 


tttctctgca 


cagtaattcM 


961 


ctatgtacag 


ctagggtgaa 


tagcaggaaa 


aaagagatca 


gatgataaag 


agaagggaga 


1021 


cagatttgtg 


tgacttgtat 


aagatacttc 


gcctctctaa 


tcctttgtcc 


ttatctgtaa 


1081 


gaaagcagca 


gtacccaggc 


tgctgtgagg 


atcacatgag 


acattatgag 


agtttagcac 


1141 


aggacttggc 


acataacaag 


cagttattag 


gatgatgtta 


gtttggatta 


tgaMttcaag 


1201 


atactacagc 


tacatttttt 


atactttctt 


aaacatcaat 


ttaagtcaat 


tttaattaac 


1261 


gttattttac 


taggagaaat 


gtctctaaaa 


acataagaat 


tctatttgaa 


gtctacaatg 


1321 


ataattacat 


tactctgata 


tcttaacttc 


ctctttatga 


tttcaaataa 


ttaaaattta 


1381 


agtttatgtt 


ttcaacaagt 


gcttattgaa 


cctctactct 


ttgccagtca 


ctgttgagga 


1441 


cacagaagaa 


atggataaga 


tatcactaaa 


aacagcttct 


gcgatatcag 


tgggtataca 


1501 


taaattattc 


acactcaaca 


acaaacacaR 


gcaaagctgg 


gagaatgtgc 


tgacaggaaa 


1561 


gaaaatgaag 


agtgggagag 


tggacagatg 


ggaatcgatg 


tgaaatttta 


taaaacatgt 


1621 


cttggccaag 


ctcagtaggg 


gaaatatgtg 


gaaagtaagc 




agRgaataaa 


1681 


agtataaata 


agaaaaaaaa 


tccagaggtc 


acagtggacc 


actagccaaa 


atgggttcac 


1741 


agcaaacaaa 


caaaaaaatg 


taatgaaata 


ctaaaattaa 


agaagtacaa 


ataaaaatga 


1801 


gattctgttc 


ttatctgtca 


gaatggcaaa 


gattcaaaga 


gtaaaatatt 


cagagttgtt 


1861 


atgggtgtag 


tgtggaactg 


gtctcatatg 


tggctagaaa 


aagggagaac 


tggtataatc 


1921 


cttccagaga 


ataatctggt 


aattattttt 


taaattggca 


aatatttaaa 


tttggcaaag 


1981 


acttaaaaaa 


tatatatacc 


cattggctca 


ataattcctt 


tctaggcacc 


aatactaata 


2041 


aaaagtattc 


aaaaaattaa 


atagatggtg 


tactgcaata 


taatgagtga 


aacatYgcaa 


2101 


gcagcatcac 


tttatacact 


aggatattgg 


ttataaaatg 


gtaatacata 


ttttagatag 


2161 


aatataatga 


aacaaaaatt 


ttgcttatca 


aaaaatgtta 


gtaacatggg 


aaaatgcttg 


2221 


taaaacaatg 


ctaactgaga 


agagcaggac 


aggtaggtag 


atacaaggcc 


aaacagatac 


2281 


acacRgtgga 


gagacatatc 


ccaattgtta 


ggaatttgct 


gtggttggtt 


aggatctgaa 


2341 


caatttatat 


tgactttgtg 


atgactttta 


aacgttttct 




aattttttat 


2401 


gtttataatt 


tatgctaaag 


aacaagatgt 


attttttgta 


attcaggagt 


aaagaggaga 


2461 


gtggggtatt 


ccagggtgct 


gttcctgccc 


taaccaggag 


cacgatgcag 


tcataggcct 


2521 


ctgccctcct 


ccagcagcct 


cggctcagaa 


caggacgaag 


ccaaccctct 


ctgaccagca 


2581 


ccacctcggg 


cgacccctgg 


acgctccacc 


tgcccctgtg 


tgtggtacct 


gccctgctac 


2641 


acctttaaca 




caagctccct 


cctccttcct 




gtggctgccc 


2701 


agacccagat 


gtaccctgcc 


tgggtgtggg 


aggccgcaca 


tcagagcagt 


ctatggctgt 


2761 


gggcttcaca 


gctgaagctg 


ggccgtggga 


aatgttgaga 


cgagtgagtg 


cagagcctga 


2821 


aagacaatac 


ggtagaaaag 


gttacatccg 


cgggggggcg 


tctgatggca 


tctgccattt 


2881 


attgaggatg 


ggtgcgagac 


agggtagcca 


ttatgatccc 


attttacagg 


tgacgaaact 


2941 


gagatcagac 


aggctagcta 


ccttgcccag 


ggtcacaaca 


ggaaggaagc 


aaatgcaagt 


3001 


gtgtctgcgg 


ccacgtaagg 


tgagagaagc 


gctatcaggg 


gtggtctgac 


agaatcccca 


3061 


ggccagggag 


cagttgtcct 


ctatcctgtg 


gggtcaaagg 


aggggacaag 


aaggctctcc 


3121 


ttgaggtagc 


tccaaagaaa 


agaccccgcc 


agagcacagt 


gtgctaccaa 


acgctggaac 
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3181 


agcctggaga 


ctgtccctgt 


ctcttctctg 


aggtttctca 


tttcagagaa 


actctcatcc 


3241 


gtctggcttc 




ttactctgtg 


attctcttat 


gtatcagctg 




3301 




aattgttggg 


atgcctataa 


tggctttggg 


tctccactcc 


cacagaggga 


3361 


atcttattct 


aaagttctaa 


atgtatcagt 


catattttcc 


tgaagacaac 


agaaatggtt 


3421 


gtttcattta 


cttattttta 


tgtatgcttt 


ttcaagagtt 


gttaacctgt 


gtttgtgttt 


3481 


aaataaagat 


ctgttttgca 




tttatgagat 


aatcacttat 


aaacgtagat 


3541 


aattaagaaa 


tggttcaggc 


tcagcacatt 


gtctgagagc 


attgcagtcc 


tgcactgtat 


3601 


ttcatggggg 


aagaaaagcc 


cacagatgga 


gccattctca 


atcctgcttt 


ccagggtcac 


3661 


tgagtcaggg 


tgcctggcat 


gcatgggtgc 


at:gcacacac 


aggcagaggg 


ccagggagaa 


3721 


ctgtgctgtg 


tccacagagg 


gcagcctgga 


tcacagagct 


gaccaagaac 


ggaagcagca 


3781 


agttgtttta 


aagtagttgc 


ctgtaaagcc 


acttttcggc 


aaggacaaac 


actcagagca 


3841 


tgctctgaaa 


tgacttatgc 


acagcagggc 


agcggcattt 


tgcccctggc 


cttccttcca 


3901 


ttctcccgga 


atccccctaa 


agtaacaagg 


actgacgcaa 




acttagccaa 


3961 


caggttgatg 




aatctgtagc 




attaatcaca 




4021 


ttctgtgagg 


agttaacttt 


cttcatataa 


ctcttagtct 


taattaagtt 


tccctcacat 


4081 


gtgatagcaa 


acgttcaaga 


agtgcaaatg 


tgcagacgtg 




tgcaagcttt 


4141 


atgtctttaa 




ttgagtccta 


tcaggtcgtg 


tttcaattac 


catcaaacct 


4201 




gttaattcaa 


aaaaaagtaa 


cttctttagg 


caagtcatgg 


aaatagcact 


4261 


agggtgccct 


caggtctcca 


ccaagccttg 








4321 


taagatgctt 




ctgctgccgg 


ctgctggctc 


tctccacatc 


cttcaatgtc 


4381 




gatgatggaa 




attccctttt 




gtatctttct 




aggatcagaa 


cttgcaacct 


ggggttcatg 




aaggtccggg 


gagaggactg 


4501 


tatttctgtt 


gtaattctat 


gtatttcttt 


ttaggcacca 


gaaagtatta 


ttctgagaaa 


4561 


ggactgtagg 


cttcactaga 


cttcccaaat 


tgcagatggc 


attaaaaaaa 


aaggaatcag 


4621 


gaaagtcttt 


gggaaattag 


gctaaaactc 


ctaactgaag 


cacaat:cgag 


ttttcagtgc 


4681 


aacagcagga 


tccctgcagt 


ctgtagcaaa 


cactgagatg 






4741 


ggtgaggttc 


agcacctctc 


aatattgtca 


gatgtcacca 


atgcccaagg 


gaagcaccaa 


4801 


aaccaatcag 


tgaattgtct 


gcaaaccatc 


agctttgttg 


cttccaagtc 


tttactagaa 


4861 


ctcttaagct 


gatcttaaaa 




cttagagata 


aatgaacact 


ctcacataag 


4921 


ttcccaaaat 


tcctttactt 


cttccaattg 


aaacattttg 


ttcccaattt 


caagttacca 


4981 


agggaaaatg 


atggtaagaa 


tcacaaagac 


tggatacgaa 


tatttagctt 


ttcatagtgt 


5041 


ctcatagggt 


cttacaaatg 


ttaatgcaaa 






atgcagaaaa 


5101 


ccaacttaaa 


aaattcagag 


catctgatac 


ctaatcgact 


aattgattta 


tttactgaaa 


5161 




acagagtgca 


ttgagactgg 




tatggggggc 


tgtttcctaa 


5221 


ttccagtgga 






ctaacttggg 


ctttgcgcta 


acgtaattga 


5281 




aacagactca 


ggacctcctt 


tttccactct 




ctgtaaaatc 


5341 


atctcattac 


tttacccagg 




gaaacatgtc 


ttaaaggaag 


tcacctataa 


5401 


agagggccta 




agattaaaaa 


aaaattcacc 


tgtagtcatt 


ctaagggttc 


5461 




atacaataaa 




aatgtaataa 


aatgttaaat 


caagcatagt 


5521 


accttagaca 


agtgtaaact 


attatttcca 


aaaaagtgft 


cagaaaatca 


caaaatattc 


5581 


aaactttcaa 


aatagcttca 


tgtattcata 


aatgtatact 


tcaaattaaa 


cagtataaca 


5641 


catttggatg 


ttttgcaaaa 




ttctaaagga 


tggtctgtta 


tttcaatgtc 


5701 


ctttttaatg 


cactgagttt 


tttgggaaaa 


aaaaccacat 


aagatcgtac 


ccaagctatt 


5761 


caagatttac 


acagaatatt 


tgatttaaca 


tgcatatgaa 


taaggatgaa 


aataagaaca 


5821 


aaaaaataac 


attttgggtc 




gtaatttcta 


ttcatgtagc 


taYaagtagg 


5881 


gcacattaac 


tctcaatttt 


ataattcctg 


tcta.taggag 






5941 


atccttgaag 


ccaaagaaag 


aaaagtataa 


ggtgaaagtt 


cttggtacac 


agatttttaa 


6001 


tatattattg 




gagaaataaa 


agctgaactt 


ctcaaacaca 


gtttctatgg 


6061 


tacatatgag 


tcaaatcttc 


taatacaaac 


atttcaaatt 


atgaaaatac 


caaaccaaag 


6121 


agtatttttg 




ggcccaattc 




tccattttct 


tatgcacttt 


6181 


ccaacaaact 


aaaagcctta 


tgaaaacacc 


accaaatgct 




gtgcagcaaa 


6241 


actaaagaca 


tgcaacaaac 


acaattttct 




ctcagcatct 


gcttttgttg 


6301 


ctgatttttc 


acatttctac 


aaatgtcacg 


aaggcatggt 


ggcctttaaa 


acaactgaac 


6361 


ggacccccat 


tcaagactgc 


atgccccttg 


acttgtagtc 


accattaaat 


tggcttcatt 


6421 


tccaggaatc 


aggccatatt 


taggattgta 


cctgtgcaga 


tttacctcca 


cattaatctc 


6481 


tacatgctat 


ctactaaaaa 


cttaggcaag 


gaaatgcatc 


agaccaaaca 


ccccacagca 


6541 


cagagaaccg 


accggccatt 


gctttccaat 


ctccgcaaac 


ctaaccattg 


ctggaagaaa 


6601 


tcttactcac 


agtgcacaga 


cagtaggtat 


tttattgaag 


ataaacatat 


agtggaacaa 


6661 


accaaattac 


ccccatttga 


gttacgtgag 


cactcagttc 


tcagcgtgga 




6721 


atcaagtcaa 


catttgcgtc 


ccattaccag 


cagccacttg 


ccgagtatct 


cttcgcttcc 


6781 


actgggactg 


cctggcatcc 


ctgatgctaa 


ggagccactg 


aagagcctcc 


aaatgtctga 


6841 


cattcacaaa 


cgcatctttt 


gctttgaccc 


gacccttcaa 


cctctccgag 


tctgctgcct 


6901 


tttctcagac 


acacatccag 


gcaccgttag 


ggatagttag 


agaatctgaa 


aattcagaag 


6961 


cgctccgaaa 


agcctttcca 


aaagtaatcc 


acagcactca 


acagtgaatt 


tagaaacccc 
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7021 aatttttttc tgagtttgaa gtttttaagc cttgcggatg gttggagtag gaaaaaggaa 

7081 atttactagg cagtgcaaag gaaatcttgt tgtcctctat tgtggcagtg ggggtgttgc 

7141 ccaaccctaa cttatctgcc ttgataaagg aaaccaaaga aaagagtaac aagaacaaga 

7201 ttttgtcaaa ttaaaaggaa ccctttcctt accttaatag tgctggccat aatgcRatca 

7261 agtttattga tcgttaataa atgttaataa taattattgc ttctctctga ccagaaagta 

7321 gttttgatga ggttgtttag agcggatgag attgtgctaa gtctgggaaa tgaagtcagc 

7381 caatggcagg aagaggtttc tattggtcct ggctgtccag cccaaagaaa caggatattt 

7441 gggagtggag agataagaga ccctgaaaac aatgttgttt ttcttgatga tatgcagcca 

7501 ggagattttt tttttttaat taaaaaaaga aaaggcatca attgggatgg ggactgccac 

7561 agcaggtgtg accggtgtgc cgccgtgtga cacactgcac tgagaccaag gcaggatgca 

7621 gatgtgatgg gactccgcat ggcttcacac gggctgcaag caccttggag ccaaggcgtt 

7 681 gagggcaccc cactgccctg ggtgtcagcc cttcgcagcc caattcttcg cagaattact 

7741 aggacagagg acttgagctc ctttctccta aaaggaaact ttgcaggtgg agtttatttc 

7801 atgttaatag atggccatgt tcagtaacag ccattgcctg gctgattttt aacaacctat 

7861 atttattcaa catttcatat aagtgttcca gaacagtttc attttctcct tccaaatacc 

7 921 tgcacttttt atttgctcta caacaaagtt gttgaaaacg caaggactct aggcttacag 

7981 taaacacRaa aaataaagag gaaaaataaa ccttcctaag tcttgttttc aagtatttat 

8041 taaaacccaa ataactgaag tgactacaaa tgtcccggaa tatcagtgag ctggtctcac 

8101 tctgacagac atccatgttg cagacaacag atcctcatag aacttttggc tacccagaga 

8161 tgccatgtga gggccccatt acatgtctaa aatccaagct ataagttcag ggtcacaggt 

8221 cKgttttctc ctcgaaggaa gtacagcgaa tgcaggcctg aacattcctt agagggtttc 

8281 aggactttaa ttactcattt tcaggaaatt gcttggaaga aacattttgc tttgaatcta 

8341 gctacaggaa cgcaggaccc ataaagaggt gtggtctcaa Yaggcgaccc caagaacaca 

8401 atagactaaa tcctgagtca tctgacaatt cctggttgca gagctggacg ttcagtaaat 

84 61 ggatttcact cagactttag gccggcatgt gtcagagttc tgtccaccag cccaggtcat 

8521 tctggctttt attatatacc tctgatttat cacctcatgt taggacaaaa gaaggaggag 

8581 aaggagaaag ataagggggg aaagggagag gaaggagagg aggaggagga gggagggaag 

8641 agggcgagaa aaaatggaga aggagaaagg aaggacagaa gagagagaga aaggaaggag 

8701 gatgggagga aggagggagg aaggaaggag aaagggaggg agggagggaa agaaagaagg 

87 61 aaaagaaaag aaagagagta tattgtaaga aaatcattct gtggaaatca gaaYctaagt 

8821 tctcagcacc tacattatga gggggtgttg atttccgctc ttttcatctc tgaacaatga 

8881 ataagatggc aggctgatta aaattctgtt tcccNgaaaa tttcaaagtc ctgagctgtt 

8941 ttatctgggc agcttccact agaattctgg agtgcggagg agaaaagctc tctcagcttc 

9001 cctgagtgtc cttgcttttt gttctctcct aagaagcatc aatgtaaaat gttaactgtg 

9061 gcctccacaa cacatggcac ctgattatgc ctttaccaaa caccagcttt aaataagatg 

9121 aacgctttgc taatgaaata gccacggaag aaaatcctgt gtggtcccgc ctcacccacg 

9181 cctcgtggtt gctaatccag ccctcagttg ttccctgcag caggaaagat cagcatttct 

9241 actgggacct aatgcttcag tgatgatttg gccttaactc cctggttctt gcctaatgca 

9301 aatcaaactg tgaagagtgg gatttcttac ctctctattt tgaaataatt tccaacttac 

9361 agaaaagcca caagataata caccaaactc ctaataccct ttgcctggtt aactctgatt 

9421 gtctactggt gccaatgtac atcattttct ctgctttctc tcacacatgc tctctctctc 

9481 tcaatctctg tctctctctc ccatatttat ttatgtataa catacatgca tgcatatatc 

9541 catcattttc taactaataa aaatattgtt ttataagaga caaatttatt tattggatac 

9601 taaaaaagag aaaaacagat gctattattt gatcaatatt cRatccataa gttttactgc 

9661 gtctaatttt ggaaatgcat acaaacataa aaagtgatct aataaaaagt taatttgggg 

9721 atactgagtg gtaaattctg ttccaaaatt ttacaaagga aactcctcct catttctaat 

9781 gacaaaatgt ttgattgatt cctgttgtct tcaaagagaa taattcttga tgtttcaaat 

9841 ggcatcatag catccaagcc tggtaatctc tgactattta atgtaaagtg ttttaaaaaa 

9901 aWctttggct cttgatagga caaggcacat cttttttttt tttttttttt tttttgggac 

9961 ggaatcttgc tgtcgcccag gctggagtgc agtggcgcga tctcggctca ctgcaagctc 

10021 cgcctcccag cttcacgcca ttctcctgcc tcagcctccc gagtagctgg gactgcaggc 

10081 gcccgccacc acacctggct aatttttttt gtatttttag tagagacggg gtttcaccat 

10141 attagccagg atggtctcga tctcctgact tcatgatcca cccgcctcag cctcccaaag 

10201 tgctgggatt aaagacgtga accaccacgc ccggccaagg cacatctatt aactagaata 

10261 tagcagtgca gaaaccgttt cccaaatatt atcctatggg gaaattatct gaagatatag 

10321 atatcactct agctttcctc acttgtaccc caagtctatg ttattaaatg gcctcatccc 

10381 tacaccaccc agcagcctaa gaaccaggct gcctttccca cctgatacct cctctcccta 

10441 gggaaatatg ccctccaagc caccacgcgg accaagttag ggggcaatct ggcacacggc 

10501 ccctacctga ggcccgtcct gccatgagcc tatcagggca gggtcctttt tcactctcct 

10561 tgagatgcag taatcaccac actagaccag cacagcccat acagtagctg tggacagcRg 

10621 aacctatgaa taaagcacac atttcccttg catgcttctc tgccacaaac ccatcccctg 

10681 tgaatctgcc aaaaccttcc actagtctcc tcacatttta ctccaaagaa agaaacaaaa 

10741 taatataaca taaatatacc ataaattcac tgtcacaata aaagaaacac tagtatttat 

10801 ttatcctaaa atcatttctt cccattttct aaagatcgaa caagttctca ttcttgttcc 
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10861 cttactaact tctctgtatg gaatatgggg aggaaggtta ggagctaaga acactctgaa 

10921 tgacagYcac atggttctca atgtaataca tgcgcagagc aaagagagga gatccgtctg 

10981 aacctccaca tctgtgggga cacaacagaa gattcactca tgactcattc aYgtctttat 

11041 ccctgcccta tacatcccta caatggaacg cccttcctcc tcaaaaacat tgcaggaaaa 

11101 tgatgtcgta gatgaacact tagcaatatg gaaaactgct ctgatatttt acctggttag 

11161 ttaaaaaatc acagtatata ggagaattat atccatagac tcattttgtt aaaaataaaa 

11221 aggcagttgg ggatggccgt gcattctgtt ccaaagcatc tgtgaacctg aggaatagac 

11281 accatgagcg aagctaacct tcccgagttg aaaaatgaaa tgcaatggtg gcagacatgt 

11341 ccaaggaata ttatggggac tcgatccctt tatgaatctt gtcatagatg aatgtgtgga 

11401 gatgtcaact agttggcagc agaacagtat tggaacggtg gtaatacaag gaaatagtat 

114 61 catcatatta gaaactttgg aatgagtata aataatggct gttcaacaga aaaacccatg 

11521 tcccctctcc aaagggcctg tttcactata tgtaaaaatt aggtcatgta tgttttcata 

11581 ttagactttt tgttaaataa cctttttttt tttttttttt gagtctcact ctgtcgccca 

11641 ggctggagtg cagtggcgtg atctcagctc actgcaagct ccgcctcctg ggctcatgcc 

11701 attctcctgc ctcagcctcc tgagtagctg ggactacagg tgcccgccac caccctggct 

11761 aattttttgt atttttagta gagacaggtt ttcaccatgt tggccaggat ggtctcgatc 

11821 tcttgacctt gtgatccact cgcctcagcc tcccaaagtg ctgggattac aggcgtgagc 

11881 caccacgccc ggccaataac cttttgtaat agtcaaaaaa taaaaaataa ataaaaaggc 

11941 aaagtaaaaa tattgacagc tatatcttat aataccaatg gcagagaagc tttttgtttt 

12001 ttcccctctt cttctttggt tcacttatat tttcaaaaca taaatttgta ataaacataa 

12061 attatatcct aatataaaaa tatgtaagta acatgcaatg ttgttcattt ttgtaaatat 

12121 ctgaaaaata caggcatgtt ctgtaaaaag tgcagcccac aatacttgta ggtgtcaaac 

12181 cttttagcat gcccaaatgc ctgttctttt ttaaatgtct acagcaagca tagagctctt 

12241 tttttttttc attttattaa gaaaacccag aaacgctcaa gagtttcgtg gaggccatct 

12301 atgcggcatt aatccattca ggtgatatta atggcctttt cctgccagga actccagtgg 

12361 gcactcagga agccaggatt acagggctct atccagtcac cgcatgtctc ggcattcggg 

12421 aagcctatct cagtctcctc aaattctgca cacataaaac ttcagagcct gggagcgacc 

12481 catgaaacgc aggtttttaa ggacaaaaca ataggacaaa agtgctgtat attgtcctta 

12541 agatgacaca tgaatttaaa atqcatagtg tttggattat tcatgagagc cccgcaagaa 

12601 cgatgtccca ggggtttttg ccaggaacat gcctgagatt gtagataaga atcaagcatc 

12 661 tggggctgcc gcacaatgga aaactccagc actccatgga acttttccat ctgcagcagt 

12721 cggaggattt gcctgagaac atacgcggca tgaagacact ctcagccctc aagggcaccc 

12781 agtcagcgct gtttaaggac ggtttttctg ttcacagcat acttcatgat tacagcttca 

12841 ctccattgga tacaactatg tgagtatgtg tgtatgagtg tgtgagtgtg tgtatgaggg 

12 901 tgtgtgtgtg ggggggtagg ggagtgccat atcccccagc agcacattaa gaaataatcc 

12 961 aattacaatt taaataacca ttgtttcaac actttctcta agtggtgaat gtattcttca 

13021 gtctcttggt tgatctgaac taatagaaac caaggaaact gttatcacat accaaatcct 

13081 caacttctca acaccaagtt gcaatttcct taatactgaa gcacacgtca gttcagttct 

13141 ataagctccc atggagcaat gatttcaata aacacaattt cataagcatc ccacacgttc 

13201 tcaagtttag gccaattttg ctcttactca actccatatg actttaaaaa tgcaggaata 

13261 ttaaaaccat tatcaaggac tccaaatagg tcaatgtatt atctgtggtt ttaagaaaga 

13321 aacctaagag gagagtaatt attttgtcac cattatttag cttaaaataa cttttaaatg 

13381 tattgttcca attattaggc ttctccatat aatttggaaa ccattaaatg agtttcaact 

13441 tctttgtccg tataactggg tatatctgag catagatgct acagacatca cgttgccatt 

13501 ggtgcccata aggcttcgta tgcccacagg gcatataatt aagattcaca aaagcatctg 

13561 actggcatcc cactctaccc ccgactcaga tccaaagtat cctttcccag gtactgtccc 

13621 ctgtcaccat ccttggccaa attgagaatt tatcccaggt tgttgaaatt aattttgatt 

13681 ttgattctag tcagagtttt aaatgatttt ttaagaagtg tgtcaatctc acctataaga 

13741 tactaaacag catcacttat ttttgtctta atctgtcact attttatgaa ttatttttaa 

13801 agaaaataca gattacttct taaagaaaga tcataagtgg cactataata gcattcaatt 

13861 gatagaaatt gatgtagaag cctgcattaa taatttttcc tgctgctttc taaagttggc 

13921 ttttagtttt cggttgaggt gggctttaat tcagcatcac ctcatttgat gatttattta 

13981 ctaatttatt tccatcacaa atggaatcag aaatgaagcg aaggcaaaaa atctttgcca 

14 041 caaattcatt tatatttgca tgagtatcac caggctacct gcggatgcaa tgaagcccta 

14101 tacccgtgcc tgaagtcagg agcacaggac actgggagtc tgtctcagcg cagtaactgc 

14161 tacatggtac gctccacatg gtatcctctc cacctgagcc ctcctcatcc ctcgccccct 

14221 ctccacacct caccctctcc acactgtatt atctcccact gcaagcaacg cagaggacac 

14281 agaggtgcag ctgaagtgaa ggcaggccag ttcttttcca attcactctt atgaaaatgt 

14341 gttaaatccc gttaaatgca ccgacctgaa ctgaaaYgat gatatcagta aatttgttgt 

14401 tgtcatctga tgcccacaca gatctcagca catgttttgg gaaaggctgc agtgttcaag 

14 4 61 gagtactaga aatgttcttt ctgagtggac tggccatgat aagccaaaga atattatagc 

14521 cgatagggat aaatcatggt tttcccactg cgcggttgcc tggtggagat tattccagcc 

14581 tttatttaca gctacaaaaa taatcctgcc tcaaggattt taggaagcaa ccacctcctg 

14 641 cctaaagaac tacactgcat tgttgacttg ccagctagaa accagacatg agtcgtgcag 
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14701 ggaagttccc tttgaacggc ttgaattgtt gtcactagag ggtgctaaat gcccctcaaa 

14761 aaggccttta gagagaatac tagtgtgcct aattcttgat ttaaattcct tgaattgaca 

14821 ttattttaca aagtggtctc attttttact tgttaaaatt aagatctaac atttataatg 

14881 cagaaatggt gttttttttt gaaaaatttg gattgttcat tattatagaa cataaagaaa 

14941 tacacgtttt aggagcaaac atgaattaca cagatctgaa tataagatta aaggcataaa 

15001 tctggggggg tgagatttgt ccatggaaat tgtataattg ttttatgcca ctgttaatga 

15061 ttttaaagcc tgaaaaattt accactttac aataatttta gattgttcaa tatgttccca 

15121 aatgtgagtt gcatcaaatt cggtaaatgt agagtatgtt tgttttctct gtacacaata 

15181 ctgacgtcag tgctagtgcc tgcctaaact cagacaccag tagatccttt tccagaacta 

15241 agtgttatag gaggaatatg gattatactg taatatatca gatgtgaaaa aagcctcacg 

15301 gtcccttcct ttctcaaaaa aaacgtgtag tattttggaa tatgcatgga gtatagcact 

15361 tctaaaaatg gtactttatg tatatatgaa gtaaaatagg ccttaaaact taatatacag 

15421 atttccttga aaataatctt attatacatt tacactgact ttttggtcat ccaaacatta 

15481 tgcacttaag caaaaagaaa tataaccaat tttgcaactt tcctcttcct gcctttgctt 

15541 ctacttaaaa atatctgcag tatttccttt tctcgccgct tccactttgg ttcatatacc 

15601 aaggaatYgt gaggtcaatg ggtgacagtt gtttgtctgt gttcacggct gtcgggcggg 

15661 gctgggtgtg ggtaaggctc tgaccgacat ctgaagaggc ttcacccact gcggtttagg 

15721 atacattttc ctctcgctct gccagtagaa cagggaggct agttttcaat gtgttctaca 

15781 gaaggataat taaggcagtg tgtcacctgc tcgtctgcta tttttaaaag gaaacaaaca 

15841 taacctttgc atgtgagagg cattgcaaac tagatgggag gaggcccgga ccctgtaaca 

15901 gtcgtacatt ccggtcccag cttcgctSgg gtgcaactgc agataaaaga cgctggctcg 

15961 acaaaaagcg cccctgggct ctgcattgcc agcatgcaca atacgaattg ctaatgatga 

16021 gctggcttag gtaggaactg aggaccctga gtgcagggct aacctctgct tcacaggagg 

16081 ggtccactgg cgcgcgctgg aggcggggag ccttcgtgat atttctccga agggaagagt 

16141 ccatgcagca tcctgataac agggaccttg gcttcatccc caactgctcg gtgaccccga 

16201 tgcgtcattt aattcatttg tgccaatatt cctttttcaa tcaaccagta actttgcagg 

16261 tttgttcaaa gaagaaatta aaatgcattg gagatRgaag gcagccatct tcccggagcc 

16321 ccagtgctga cactgggagg ggaaccccag gattccctca tgactgtgtc agtgatggcc 

16381 ctcaacaagc acttactgaa tgagtgaatg aatgaatgaa tgaatgaatg aagccggcca 

16441 cagagcccag gagtatcagg ggagagcaat actggctgLg Lcagtggcta gtccaggaga 

16501 gcccttcaag agcccagagt tcgacttgaa cttgggactc ccctgggatg ggtgaggttc 

16561 tctcaacagc agttgacagg agggaactgg gacaggccca gccagtttca ttcagtccag 

16621 ccagctgtgg ctcctcttta ggaatgtggc caggcccagc tgtggccaaa gcaccagcca 

16681 gtccctcagt tctgaacaga acgtgcccag catcagtgcc ttcatttgtt tttaaactgc 

16741 ctgctgacca tccacaaagt taagatgaca gctgggactt aagttgtgag gaaccagtgg 

16801 ctcccccatc actctctctc acacacacgc acatgcacac acacagagac acacagacac 

16861 acacacatgc acacacacag agacacacaa acacacatgc acacaaagac agacacatat 

16921 aaacacacat atacacacat gcacacacat atatacacac agacacacag gcaaacacag 

16981 acacacaatg gatggtatcg atatctatga acagatgaga gagagagaca gagcaqqagg 

17041 tcatctgctc cccacccctc ttctccccac tgtcaccacc tctctcccaa gtctagctgc 

17101 aggccctgtg agctgtggaa gtggcttttt taagctgtct tcttagattg agtcattggg 

17161 cagtgaggaa ggtaaatttt ctctaagaag cctcccaaat ctgtatctta tgggatttcc 

17221 gtctcaataa ttgagctaat agtacctgta agcaagtgga tagtcaataa aaaaaaaata 

17281 ccggctgtaa gaacgaatga atgaatgagt ggatgagtgg atgagtatac atcctctatt 

17341 ttgctggcat catgattatt ttacagcaga tttatctgga tttcaatata aaataagagc 

17401 tcccatttgt atacaactta gtccttgtaa ttttccaaag taactctgat taaaattatt 

174 61 tccagccaaa ctatttaggc aactgggttg gtttcacacg tacttatcag aaaccattag 

17521 aatggtatat tgattgcatt tctagacatt tgccatacca ccatggcaac tgtatttccc 

17581 aagatggccc agtaggaWgt cccatcatgc ataaccttct ataaagtgac attgactttg 

17641 cttccatcag aaagtgaggt ccacgttccc actccttgaa tctgagttgg gctgtgacta 

17701 cagcagaagt tcagctatgt ggcttccaag gctaccttgg aaaggtgata gagcttccac 

177 61 ctgattttct tgggaacagc cactcttaaa cctcagccac catacgggga gaaagcccaa 

17821 gccatgtgac gaggccattt gtaggtgttc cagtcatcag tcccagccca cagccagcat 

17881 taacccccag aggtataaat gaagaaacca agatcactcc agccccaacc ggcacttgac 

17941 tgcaaaccta taagagactc caagcaaata atatctatct cagcccaatc aagccctgga 

18001 accatgagag attctaaata ataaagtgat cattgtgatt ttaagccact gagtcctgca 

18061 gtgagttgtt atatagcaac cgggcatcgg cataccatca gaagattcct taagaagaaa 

18121 cttttgcaac gttgctacag cagaaaatat tccttctgta tgatcattca tccattcatc 

18181 aatcttgatt ggcattgcct atgtgctaag cattgtcggt cactggaggt gcagaaaaga 

18241 agtgaagcag tattcagcca aacaaacaac ttgcatattg gatagagaga aaaatatgcc 

18301 aaaattacaa cagtattttg acccaacaga aaggacataa gaagggaaca tcaacagttc 

18361 cttcctattc ttttccaccc tttttctttc taggctcatt aaatgtagag gtgcctcagg 

18421 gtcactttgc cccatgagag tagataccct cctctcccat ccctaacttt ggtcttcact 

18481 ctatgcctga taccagcagc tccccaggct cctccatgct gttggcaccc cattcccatt 
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18541 ctctggaaac atctcagccc atcaacctca gattactttc ccaaggagat gtgagaaaac 

18601 ttcaattttc ccctttgtga cttgcagaga aataggggtg gtaaatgaga agcagaggga 

18661 gagggtgggg attcattacc ttgagttact aaagagaaaa gctctagtgt ttagaaggtg 

18721 ctgttattat tttctctgaa acattagctc ctagaaatca gaggctctgt ccctagcatg 

18781 gtctctaagg cttccaggac gccccagggg cttccctcag tcaattttga ctctaatgag 

18841 agccatctgg aaggctgcaa tggtgggcct ggctgatcac tggaggggcc ttcgctgtgg 

18901 gcacagggac gtgcacaggt tcacagagtg tgtctccgtt tctgggaacc atggtcctgt 

18961 gtgctccttc cccaaagaaa gcaggagaaa gaaaaaccac agtcctgagt gggattctcc 

19021 ccaaacatct aaaaagcttc tttaaagaag tcaagttatc ataattaaac tcacatatgg 

19081 aatgcctata gacaacgtgt tgctcttcca atcccaggga gctaggacac ccaaaactca 

19141 cctcccagac ggagctagga tttgagatac tggattccac attacttaaa gctgttcgag 

19201 cttttcatca ggaatggaaa gcattaatgc gtctcacacg tcactctctg tttcatggaa 

19261 ttcttacact gRggagctct ttttatttta ctatttttct ttaacagcca gaacactgag 

19321 aagtttgcaa agaaattttt cagctgcctt tagtgaccct taactcaaga ggttttgcct 

19381 aatctgtaaa ttggagttga aaatatttca gtacttttgg agtggggagc tttttgtctt 

19441 taaaagagtg gaacgacact aacaataggg ctatacatac cttgtgcttc tttgtcagtt 

19501 actgcaaacc aaaaccagat gtgaagtatc agcggctacc tcgtagtgac ccatggaagt 

19561 tgaggctaat ccatatcatt cttaataact caaaactgcc accttttaac cacctaattt 

19621 ttttctcact tacaaatgca aaaaaaggaa ttgacagtac acattgaaat cgattttccc 

19681 ctgaccactt acagattttc tcctataaga gccaggaata aaagctcact ccattatatc 

19741 aagcagccat catctgagta tgctttagtt taggtgatca tcacctttaa caaaccaaga 

19801 tctggaagga agatttctga tctttaacaa accaagattt gaaaactaga tattcctgtc 

19861 atgacctttt cccaacattc agtatgtaag ggattcactg gattattaga ctgcttgtaa 

19921 atgtaagaaa acatagaagg tttagtgtga aggagttagc aacctaacaa atgtttcccc 

19981 ctgaattttc aataaccttc ctgaatttaa aggtaatgaa tataaatgaa gaacaaaatc 

20041 aatagatata ggaatgattc acaacctccc aaatgaataa atcRatcatt ccaccctaca 

20101 tttgaacatt caatcaatat tttgtctatt tactaaatct aagaggaact actttaacat 

20161 aaaactaaat atgaaattat ttccagcatt gaatattgtg ttttgcctta gagcaggcac 

20221 aaagtggaat gcctatgggt acaaagctag gcattgattg atgtttaata cctacaaata 

20281 cagcaattac actgtcatgc tacacgagac ttcctattcc tacactgaaa ttagactgca 

20341 catatgtagg gataaaagat aagacacaaL catatcatat ttcctcaaat ctaagacttt 

20401 gggtggtaag atgtaaagca catcatgatt taaagtatca ctaggaaaga aaaaacaatt 

20461 ctgccaatga aaattcaccg taatactttc taatcaatca attttaagat acatcccaat 

20521 ttttaagcaa gaataaatgt gttaaaatta acacaaattg gtacacttgt gatacaWcag 

20581 aagcactaaa atgtcacagc aatagaccac agggtttctc aaccttggca gcatcaacat 

20641 tttgggttgg ttaattatct gttgtggggg ctgccctgtg catggcagga tgtttgctag 

20701 cattcctggc ctctacccac tagatgccag taacacctcg cagactccaa ttatgacaac 

207 61 caaaactgtc tttagacatg gccaactgtc cctaaaggtg caaccaaaaa ctgtcccagt 

20821 tgagaaccac tggaacagag caattcggtt ttatcagcct tgtaatataa ttgtgtgtgt 

20881 gcactgtgtg catgacagac tgtcagccta atggaaggta tattgtcaga tgggatagat 

20941 tgacctgaaa aaatcacata aaccaacagc tggtacatat acaagagtgt gtcctgccca 

21001 tcctaaccca cacaatgcaa aagaacctgg atgaccacga gtcaccatca caggagttta 

21061 ctatccataa gtgtaaacca tccaagaaaa gtgaaaacag attaaagaaa ataaaaataa 

21121 ataaaccacc acacaatatt tgggcctagc cagataacaa caatgaaaaa atgtaatatt 

21181 ttgaagtatt ggcaagaatg tggaggaaaa agcattgaca ttcgtgtata atgggaataa 

21241 aataatttat cctttgtctc cgcaattgat attctagaaa tctatttttt agaaatactc 

21301 acaatagaca gttataagta caaaggatgt ctgctattac actgctcagt aacagtaaaa 

21361 tatgaaaata cactgtacgt ctttcaatca gatattgaat aattatacta cacagtatta 

21421 agaatgagat agatattaat gtactgcata gaaaatattc aaagtataat aagttttaga 

214 81 aaattatcac atatgtatat tagcatatgc agagaaaaca tcttaaaata caaatttcag 

21541 ttttgaggtt tggatcatgg agtaactgcc cccttttcaa gtcaaacagt tgagaaaatt 

21601 taggataaaa caagtgtgtg ctacctttgt aatcagataa agtattttaa aataattcac 

21661 attataatat agaaatactg aaatgccaag aaaatttaaa tatattaatt aaaacctacg 

21721 ggccaggcgc agtggctcac gcctgtaatc ccagcacttt gggaggccga ggcaggcgga 

217 81 tcatgaggtc aggagattga gaccaccctg ggcaacatgg tgaaaccctg tctctactaa 

21841 aaatacaaaa attagctggg catggtggcg tgtgcctgta atcccagcta ctcaggaggc 

21901 tgaggcagga gaatcgcttg aaccagggag tcggagggtg ccgtgagcca agatcatgcc 

21961 actgcactcg agcctgacaa cag'agtaaga ctccgtctca aaaaaaaaaa taataataat 

22021 aggctaccaa ttagacttta tagaatacat aaatatatta atatagctac aaagaggttg 

22081 aagcttattc attgagacaa gagtatttat tgagcatcga ctttattaaa tacactgtta 

22141 gtaagataac aagtgaaact tgttaaaaaa aaagaaacta ggacaaaaga aatttgagtt 

22201 gagttcccca aaaacaggga atattcagat tgtaggggaa attgggtaaa ggatttgttt 

222 61 cagcatgaaa ggagcaagga gcaaagaaac taaatgggag gttatgctga gaagtatatc 

22321 agtatgagga aatatcttga catctacaga caaatatttg ttgccaattc attttgggca 
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22381 


ttgaggattc 


acaagaataa 


ttaaaaacac 


qtaatataaa 


atccttctqa 


ttaqaqcac 


22441 






aqatgqctqt 


qqqaqtcatt 




tactaacttq 


22501 


aacaaggctt 


qqttctqcaa 


ataaaacaqa 


aqaqtaaata 


tqaaqtqqaq 


cacataat 




tgttactgtt 


tctqtqqatq 


aqcttccata 




aaa catca 
aaaggca ca 


atctcaaaa^ 


22621 




cccaaaqaaa 


qaaqaa aaa 


aataacaaat 


gagaacaaaa 


tt 


22681 


tataaataat 


ataataoaoa 


aaataaacaa 


aaccaaatct 




^tt ^t 
aaaga ag 


22741 


tatatataaa 


tcactoataa 


ttacaa 
tactacao^t 


aaaagaggga 


tt t 
agg aaaa 


caaag c 




aacaataaaa 


aaqaaactat 




gc g agaca 


ggaagga 


aa agaa a 


22861 


tatqatcaat 


ttcctatcaa 


t 










taatttacca 


aaactqtcaa 


aaqaaaaaac 


acaaaacct^ 


t t^ 


a a aaag 


22981 


acattqattg 


aaaaatctca 


qac 


a^^acaactc 


caaaaataaa 




23041 


aaaaaaaaaa 


aqttccaqqc 


caqacq att 


t t 

ca ataaaa^ 


ttctaacat^ 


^^ttt^^^^^ 
ca aaaaa 


23101 


aaaaaaaata 


ccactcttac 


ataa ctttt 




aaaa aaa c 




23161 


aatataacta 


qaaaattatt 


taaaaa 


aattat^^^t 


caatttcata 


^ga accaa 
aa a a 


23221 


gatgcaaaat 


cctaaataaa 




acaa^ c^^ 
aa cca 


ttt 
^^^^^^ ^ 


cgacaa aa 


23281 


atatattgtg 


aacaaac gg 


atttatttta 




c g caa 


caaa aa c 


23341 






atttatataa 


cttta^a^^^ 
aa aa 


aagaagaaaa 


^^^^ ^ 


23401 


acattcaaca 


ttcatattaa 




aaa agaa 


a aaagagga 


ag a aaa 


23461 


tcctacaaat 


tgataaaagg 


aatctac 
ttt*^ ^'^tt^ 


aacc acag 


aaaccacaca 


ca g c 


23521 


aaaactttaa 


aatttaatca 


gaag g 




caaagaagcc 


ccaca a a 


23581 




aaa^^tt^^^ 


gg g 


ttagttagta 


aagaaggcaa 


ggaaa 9^^^ 


23641 


tacaatatta 




aaaaag aaa 


taaaacttta 


ttacagatg 


acatcatttt 


23701 


ttt 








^ 


aatgagtttt 


23761 


ttt 


tagatacaag 


g caa a ag 




a a a ca 


gcataaacat 


23821 


t t 


caaaattttt 


aag ga ac 








23881 


ttt 






gaacctcLac 




acaaaat'^tt 
acaaaa a 


23941 


attaaaaaaa 


gaccaaaata 


aatqqataqa 


tatattcatg 


att'^aaaaa 




24001 


aaaaataoaa 


ttttcttcaa 


attqatcaat 




ca at'^catt 
caga .g 


caa a ca 




acaaaqtttt 


''ctcttaaa'' 


a t attct 
y g y 


aaaatat^'ta 
aaaa a g a 


ggaaa gca 


gag caaaaa 


24121 














24181 


actggaaaac 


eta aq a 


t^^atacatt 

taaaca^acc 


tct^aacaca 
c gaacaca 


tacaacct^^ 
aataa'^att'^ 


^ tt*^*^ 
caaga gaa 


24241 


tcaaaaaaaa 


actqaaaact 




aa aacaag 






24301 


aaaatctccc 


aacaaaqaaa 


c 




cactactaat 
ac ac aa 


c accaaac 


24361 


ttaaaaaaaa 


qaactaatac 


caatcctcac 


caa^ccattt 




aagaggagag 


24421 


aattcttcct 


aattcattct 




cattaccctc 


ataa a t 


cagacaagaa 


24481 


cacaacaaaa 


aaqaaaacta 


caqqctqata 




acatcacttc 




24541 


qaatacaqac 


caatatttct 


qataaatata 

gg g 


aqttttct 

g ag 


tattttgttc 


t 

agcaaaagc 




atcaacaaaa 


tattaqta c 




cagcaca ca 


aaaagataat 


gcacca gg 


24661 




t t 






cacaaa c 


aa aaa a g 


24721 


atacatcaca 


tcaataaaa 


aaaacaaaaa 


t 


a a caa aa 


a gcagaaaa 


24781 




aaaacttaac 


attgtttcat 


ag aaaagc 


caaac 




24841 


aaaaacatac 


ctcaatatat 




aca ga aa 


cccacagc a 


aca a ac 


24901 


gaacggggaa 




tctctcctct 




acaagacaag 


ga c ca 


24961 


tttatcactt 


ctattcaata 


t 


t tt 




ggcaagagaa 


25021 




gtcatccaaa 


ctqqaaaaqa 


aqtcaaattq 


tccctcttt^ 


gaga gaca 


25081 


aatcttacat 


eta aaaa 


ctaca actc 


caccaaaaaa 


ctctt tt 




25141 


attcaqtaaa 


qttqcaqqat 


acaaaaaaat 


ca ta catt 


tatataccca 


ataat^cact 


25201 


qqctqaqaaa 


qaaatcaaqa 


a aaaatccc 


atttacaata 




aaaaatatct 




aqqaaaaaaa 


ttaaccaaaq 


aaqc 


tctctacaaa 


aaaactacaa 


aacactaat 
aacac aa g 


25321 


aaagaaatgg 


aaaaqqacac 


aaacaaat^a 


aaa acact 


cat teat 


ga cagaaga 


25381 


actaatatca 


ttaaaatqac 


cacaccaccc 


aaataaatct 


aca^attcaa 




25441 




taacqtcatt 


tttcacaa a 


ata aaaaaa 


acaatcctaa 


aatttatat 


25501 


aaagcaaaaa 


ggagcctgaa 


tagctaaagg 


aaccctgaac 


aaaaacaaca 


aaactgggag 


25561 


catcacataa 


cctgacttca 


aaatatatta 


caaggctcta 


gtaaccaaaa 


gagcatgata 


25621 


ctagtataaa 


aacagacgca 


tagaccaatg 


gaacagaatt 


gataacccag 


aaataaatcc 


25681 


acatattcac 


agccaaccga 


ctttcaacaa 


aggcatcagt 


aacatacatt 


gggaaaaaga 


25741 


caccctcttc 


aataaatggt 


gctggaataa 


ttggatatcc 


atatgcgaaa 


gaataaaact 


25801 


ggacctctat 


ctcttgccat 


atataaaaat 


caactcaagg 


tagatgaaag 


acttaaactt 


25861 


aagaccccaa 


attataaaac 


tattaaaaga 


aaatagagaa 


aacatttcag 


gacattgatc 


25921 


taagcaaaga 


tttgatggct 


aagacctcaa 


aagcacaggc 


aagagaaaac 


agacaaatga 


25981 


gactatatta 


aactaaaaca 


cttctgcaca 


gcaaaggaaa 


gaaaacaacc 


agcagagtga 


26041 


agagacaacc 


tgttgaatgg 


gataaaatat 


ttgtaaactg 


ttcatctagt 


aagagaaaat 


26101 


ccagaatata 


cacaaaactc 


aaacaactca 




agaaaacttt 


tacaaagtgg 


26161 


gcaaagaacg 


tcaatagaca 


tttctcaaaa 


gaagacacag 


gaatggccat 


caagcatatg 
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26221 


aaaaaaaata 


ctcaatatca 


ctaatcatca 


qqqaaatqca 


aatcaaaacc 


acaatqaqat 


26281 


atcatcatac 


cttaqttaqa 


atqqcaatta 


ttaaaaaqac 


aaaaaataac 


aqatqctqqc 


26341 


caqaatqcaq 


aqaaaaqqqa 


actcttctac 


actqtqaqtq 


qqaatqtaaa 


ttaqtacaqc 


26401 


cactataqaa 


aacaqtaqaq 


atttctcaaa 


aaactaaaaa 


taaqactcaa 


tataatccaq 


26461 


atatctcact 


actqqqtatt 


tatctaatqa 


aaaaqaaatc 


aqtatatcaa 


aaaqatactt 






atttaccqca 






gaca gaaa 


a t 


26581 


qtccatccac 


aqatqaaatq 


qtaaaca 
t t t ^^t 


tqtaqtacat 
g gg 






26641 


tqaccataaa 


aataaa aa 










26701 


tatqtcaaqt 


aaaataaqcc 


aaqcacaaaa 


qacaaatacc 


acatqttctc 


actcctatat 


26761 


qqqaqctaaa 


aatqttaatq 


tcataqaqat 


aqaqaqtqqa 


atqataatta 

g gg 


ccaaaqoctq 

g gg g 




aqcaqaqaqt 


gggagtggag 


aqqaa a 
t ?^ ? 


ttt ataat 


ggg acaaac 


a acag ag 


26881 


ataqaaqqaa 






t 

cagag age 






26941 


tattqtataq 


ttcaaatcaq 


ctaqaaaa a 




t^ttcct^ac 


acata aaat 


27001 


qaqaaatacc 


caaqqtqaca 


acaccccaa 


aaccct act 


t^atcattac 


acattctat 


27061 


catataacaa 


actatcacat 


qtaccccat 
t t t t 


actat t 


ttt^t^ t t 


t*^? t'^t^t^ 


27121 


tctatctatc 


tatctat ta 


c a c a c 


a c a c a 






27181 


aa caaatc 


ttaaaaaaaa 




gg aacc g 


ctctactaaa 




27241 


aaaacacaac 


aattaqqtca 


g g g 
gaa 


ca a 


gaaacaaa c 




27301 


aaa aaa a 


aagttcagaa 






ac ga 


gacga 


27361 


t t 

gg gg g 


tgattttttt 


acaaaggg 


t 


aaaaaaggg 


agaaac g 


27421 




gctggaaaat 


gagaaa cca 


catgagaaac 


agaaac ga 


gaag aac c 


27481 


ataacatcac 






ctataaatct 




agcaaaaac 


27541 


atqaaqtttc 


tqqaacataa 


t t 


tatattcctg 






27601 


taaaaacaaa 


qcacaqaaac 


actaagagta 


aaggaaagac 


taataaattq 


aactqcatta 


27661 


aaataaqaaa 


aaaattaqqa 


acttaaattt 




acactattaa 


a aatcaaa 


27721 


aqqcaataca 


tqqaqtqqaa 




ttaacacatc 


taatctacta 


ttcatatcca 


27781 


qaatatqtaa 


aqaaccccca 


tcaatcaata 


g a c 


caa tca'^aa 


tacacacaa^ 


27841 


qaaaaacaqq 


qaaqaaactq 


cacaaaca 




atatccaac 


t^ctcattaa 


27901 


aaqqtrttct 


acttcattao 


aaacaaqa 


agtacaaatt 


aaaacct aa 


t^aaataaca 


27961 


ccacaqatac 


atcaaaat t 


ctaaaacaaa 


qactaacaat 


gttggugaag 


a g gqaqca 


28021 


ataaqaactc 


tcacacacaq 


ctqqaqttta 


Raaaacatcc 




aucagggctg 


28081 


taataaqaqq 


catataqcat 


attcaaataq 


qtaaqt aac 


aaaatttaat 


gaaagaattt 


28141 


acaacqqtaa 


aqqtaqaatt 


taqtaaaaac 


caaaaaataa 


aqaa tcccc 
gg g ccc 


caggc agca 


28201 


a aaca aa 


ccattacta 




taca 

aaacaata*^^ 


agggagg ag 


tt^cat'^^'^'* 


28261 


qcaccqqtaq 


ctatqaacac 


qq 

gg gagga 


gg 


g g gg 


gg g 


28321 


qqaaaaaatc 


actqccaacc 






ggagaaaaa 






actctctcct 


cctactctca 


ggcttccttg 








28441 


aaataaaaaa 




catttgatgt 


at 


t t 
aaao^aaatc 


cc ggacaag 


28501 


aqqaqcagaq 


aaaqatqqaa 




aaaaqtaaac 






28561 


ctqtcatcaa 


aaactatata 


attccactat 


ttaat 

^^^^ 


tacaatagag 


t t t t t 


28621 


atgtgcacca 


gaagtcacct 


acaaaaatqt 




attattctct 


to ^ ^ 


28681 


ctatatctgg 


ctgcacaact 


aqattaqaac 


taaq cct 












gccctttgcc 


atctqatatq 


cttccaaaca 


aqqqaaartr 


28801 


aqcttqqaca 


ctaaaaqcat 


tttgtataaa 


atact ttt 


tat atatcc 


ccat tcacc 


28861 


acacacacac 


acatacaqac 




acacacacac 


acaacaaaaa 


caaaaaaaac 


28921 


ttccaqqqtt 




tcaqttaqca 


tcaaaact 


tttcac t t 


aaatacttt 


28981 


tgcaaggagg 


cagatgaaac 


gggggagaag 


qqataacctt 




catctatcaa 


29041 


ttcactgcaa 


ggaaggaaca 


cagagcttag 


atat aqca 


tcaqqaqttc 


ttcccaa ac 










rggaggggar 


qqccttcact 


qaqaaqtqqc 


29161 


atcaaattag 


aacctgcaca 


acaaataqqa 




taaaqatcca 


qaqaaqtaqc 


29221 


cctctagact 


gaaggatcag 


ctcctcaaat 


atcctttttq 




cacaaqttca 


29281 






acaagccagg 


qctqaacaat 


ctcatctgtg 


tccatqaqqt 


29341 


gggcaagagc 


cagatcctac 


tgaggagttt 


gaacttaatc 


ccaaacatgg 


cgggaagcca 


29401 


ttggggccaa 


agatgaatta 


gggattgtct 


gatctccttg 


acacgttctg 


aaggttttgt 


29461 


gtacaRaaaa 


ggtttgtcaa 


catgcacaaa 


ggaaagctgg 


gataaccaag 


ctaagacgct 


29521 


accgcatcaY 


tctaggggac 


gggtgactca 


ttccaggagt 


ttctgcagga 


agacctatca 


29581 


ggactggtga 


ttgactggag 


gcagagagca 


cgaaaaaaga 


acaatcagag 


atgatctgtg 


29641 


gtttgtggtt 


gttgtgtaat 


gttaaggctt 


aacaactgcg 


aaaatagtgt 


ttttacttta 


29701 


cagagaagaa 


gacttgaaga 


atacaagatt 


ggggagagca 


gatcgggatt 


tgtactttat 


29761 


atcagacgtg 


ctcatagata 


tacatgttga 


gatatcagtt 


gggagctgga 


tatatcagga 


29821 


ttcgagactg 


ggaatcacca 


gtttatagat 


ggtttttaag 


ccatgtgatt 


ggatctcatt 


29881 


tcttagtggg 


gagaagtgag 


ataacaaata 


aaagccaaga 


ttaaagctga 


accttcagcc 


29941 


tgcccatcac 


ttagaagccc 


agcagggaca 


gaagagccag 


ccgacaRcac 


agagtcgagg 


30001 


aggaggattt 


cYgggcttca 


gtgagtgtgc 


atcagagatt 


catggacaca 


acttaaagta 
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30061 


agaccagcca 


cacagttccc 


ttcattttct 


caagcaacat 


tcagcatctc 


aaagtgcggc 


30121 


tactacacac 


ggagccaagc 


cctggtccag 


ttatcacaaa 


accttactag 


atatgtcgat 


30181 


tattagctgc 


attttatagc 


taaaggtagg 


caggcacaga 


gaagttacgt 


ctcttgccca 


30241 


cagtcacagc 


aagcaaacag 


acctcagacc 


aaagtctttc 


taactcctta 


tttcacatct 


30301 


aaaacactat 


gtgacgccac 


cKccaaaaaa 


tgtgggggtt 


actctaggac 


aaaagaagct 


30361 


catgaaagca 


aagagagaag 


catagagctt 


aaatttagaa 


caatattcag 


ggtgattggg 


30421 


aaaaatagcc 


tccatacaaa 


tataatcggt 


tcatcaccag 


gtgaataaaa 


tgagagagga 


30481 


acctggtaca 


agtgaagttc 


attgaatttg 


cttgcaaaaa 


aacataattc 


aagtgtggaa 


30541 


gtttggaaac 


taaaatagta 


atcagctaaa 


gaaattttca 


ggccaactta 


gacacaatgg 


30601 


tgacaaagct 


cccctgagga 


gacaggaact 


atactacttg 


caactctaac 


tgaatggact 


30661 


taacttagca 


attaagatca 


ccgtaagttt 


gatcagataa 


gtattaaagg 


gtgtcttggg 


30721 


tacatttgca 


gattcagaca 


ctcttagttg 


tgaaattctt 


caccaaaaat 


ttgcaactta 


30781 


tggcttgcac 


attaaaaaca 


tgatatgaat 


gaaattcaga 


tatctcttgg 


taaattttat 


30841 


aattattcct 


cccatcagct 


aaagttctca 


gtatgaaggg 


ttataactga 


tgtcatatac 


30901 


aataactaca 


agctgtatca 


tacaaaaatt 


gagaattttc 


tattatagaa 


agagttgctt 


30961 


taatctaaac 


ctaaagtagc 


atcaataatt 


tttcgtaagt 


ggtccccttc 


ctatgagtaa 


31021 


gcccccagga 


aaaataaatt 


gcttttaaga 


aatgctgtaa 


gtaagctgtt 


tcatatataa 


31081 


gttttgacaa 


gatctggcta 


cttaaattac 


aagatataaa 


atgagatttg 


tactttttct 


31141 


caaatgtaaa 


aaaaaagatt 


tacttctgaa 


ataatggtct 


ttataaaata 


tgcatttttg 


31201 


ttcctttttc 


ttatcatatc 


tataaattta 


ctaataaaag 


attctatctt 


gatttactaa 


31261 


catttgaaaa 


gattgattat 


attggtattg 


caagaagagg 


ataccataaa 


ttaaaaatcc 


31321 


atatattaac 


catattcttt 


tatttgcaat 


atgacaaatg 


attgtgtaaa 


gttataatca 


31381 


ataaaagcta 


tttgaaagtc 


ataaaatgga 


tattgccata 


actgctttcc 


tagcaaggtt 


31441 


aaaacacttt 


aactattttt 


aatcaactta 


cagtgtagta 


gatcaagcat 


atctaaatat 


31501 


tttattttaa 


tcaagcaaat 


atatttctgc 


tgttcctatg 


acagtatgaa 


agtttggttc 


31561 


aaaatatttc 


caaggtattt 


aacacagttc 


agtagaggtc 


ctgtctcata 


aatcaatttc 


31621 


agattagata 


aaaggaaaaa 


gcaaattctt 


gtttacttca 


gaatttttca 


agccagttcc 


31681 


aaagacagag 


ttgttactct 


catgtgaatt 


ttcagtagaa 


agctgaacaa 


cgacgttgct 


31741 


ctgacaatct 


ttttaaatac 


catatgatgg 


agacagaata 


acaaaaacaa 


agtaggtata 


31801 


agctgtacga 


aattatttct 


cttggacata 


aaatatgttt 


ggggaagaaa 


tgcattgttc 


31861 


catatggact 


agggtagtta 


cttctcaaag 


tgggagcaac 


tcagactctg 


aaaaatggtt 


31921 


tctcaaagga 


ggcattgtgt 


ctgggtgagg 


ccagttttct 


agtcaaaatc 


attttattaa 


31981 


atgccattgg 


attgaatgct 


tccaggtttg 


ctctctgaaa 


tatgccctat 


cattcttaaa 


32041 


aaacacactt 


aggtagcaaa 


aagggcacac 


taaaaataac 


gtgggttaat 


gtttctatcc 


32101 


tgcctgtgtt 


ttctaagtag 


cccctgcctg 


agctttcgac 


ttttttaaat 


gaaccatcaa 


32161 


atagttggtt 


gtctatcaca 


aataagccat 


ctattttgtt 


attcctgtaa 


cttttYgtag 


32221 


aatccagaaa 


gtaacattct 


tgctgatata 


tcttatataa 


caccaagtcc 


aLtttccaaa 


32281 


tgaggattcc 


aatgagtatt 


catcaccaac 


atctccccga 


aaaatattta 


gaaaaaacag 


32341 


actcaatcct 


ttatccaaag 


tcagtgagta 


agaggcgggg 


gccagggttg 


gacaaaaatc 


32401 


tctttggcca 


ggaagccggt 


ggtcaccctg 


atctacttct 


ctaagctccc 


acccctgatg 


32461 


aatgttcaat 


atctgagttt 


caccccactc 


attattgtaa 


ttctccgtaa 


aacccYagtt 


32521 


acccacacgg 


aattctcatc 


tacRtctagc 


ccaatttgct 


actggatttc 


aatgttattt 


32581 


catatgaaaa 


caatcactaa 


cagaaagaac 


ctggatctgg 


tttcgaaaac 


ataatttgaa 


32641 


aggaattcat 


tctgaaggct 


gcaaaagccc 


agtgatggcc 


catcggtacc 


ctgaatgggc 


32701 


ttctcaggtg 


cctccactct 


cctccgtatg 


tctccatcca 


tctcaRaata 


catgtctggc 


32761 


tcttcctgct 


cataaacaga 


aacagagagt 


tccgggatcc 


aaaggcaaga 


ctgcgattta 


32821 


agagacactt 


tgtttttgta 


ggtgttcaag 


gatttggaat 


caaagcagca 


gaattattaa 


32881 


aattaatcca 


attttcttgc 


atttattttg 


ttagttcata 


cctctaaaag 


atatattttg 


32941 


gaaggcaccc 


ttcaacccac 


ccatttacaa 


atcaaagctt 


tctttggaat 


ccctgagacc 


33001 


aaacccaagc 


tagacagtga 


cacactgcca 


gctccagagc 


cctgcccacc 


tggaaggggt 


33061 


atggaatgca 


ctatttgact 


tgagctcaca 


tttaattaca 


gtgtcaaaaa 


tcaaacttaa 


33121 


attgctttaa 


tcagctKcca 


acactatgag 


cctttctggc 


ttgaatttct 


ataaacaaag 


33181 


aataaattca 


aatctgtcac 


aaagtggcag 


caacttgttg 


accactggac 


taggaatcgt 


33241 


tatgcaacca 


tggctcactg 


tggaaaaagt 


caacatcttc 


ctatatgtga 


ctcttttctg 


33301 


atcttttcgg 






tcataaagct 


aaattatcta 


tctgcactga 


33361 


aattactcag 


cctcagacat 


ccaaaccagt 


agctgttcac 


caatgaaacc 


tgaataaagc 


33421 


tgttgctgac 


tttccctgct 




gtatttttgg 




gcatgtttaa 


33481 


aatataataa 


gcaaaaggac 


ctttggtaat 


taaacattct 


gtaatttaaa 


actccaaRag 


33541 


acaggagcag 


tgataagata 


ttagattttc 


tttactgggg 


tgaaaaagac 


agaaagtggt 


33601 


taactgttta 


aaaaaagaaa 


gaaagaaaga 


aatccaagga 


tttttagagc 


cagaaacagt 


33661 


aacagtgtcc 


tttattctgg 


aaatgtttaa 


aagcctgtaa 


gcagagatca 


cccagttcat 


33721 


cctgagcatt 


ggtgagcaaa 


taacttctga 


aatgctcttg 


aaagagaatt 


ctccagcctt 


33781 


ctctgtttga 


aatacttYct 


aMagcaaaca 


caaaagtggg 


caaaacacaa 


gagctcatgt 


33841 


taattccaaa 


caaagagacc 


aacagtcgca 


aattgagagt 


ccatcttaga 


agcgggtgct 
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33901 gcttccacat caagcaacta ttggaaagag taggggtaga tgcagaatta agatcatttg 

33961 tgaYtacaga aagagacaag aaaattaatt tttaaaacta aaagagaaat ttatttcaga 

34021 taatccaagg aaatgagcag agtcaaccta acaggattct caagcccctc cctactggtg 

34081 tggtgtgaag atcacacgtg gtgtagctga cacccctcac atcttccccg cRcccctcct 

34141 gccacctctt tttggcattc accctctctc gggttctgta tccctgccca tgttctccat 

34201 gaaaggccaY gttaatgccc atgctctttc ctgcctctaa cagacgatgc cttccacagt 

34261 cactcaagct ttaaccctct tctggattcg gacctgtaca tccaaaagcc tgccagRcat 

34321 ctcttccaga tgctctcaga gcacatgacc agaagcacgc gcaaagctct cctgagcctc 

34381 agtgctttcc ctccaacact tcttccactc tctcagttac cgagccaaga cacagagttg 

34441 cttgtgagtc atttctctac ctcatccccc acccaacctt cctgcgagat cctgctcaYg 

34501 ttacctccta aatgctgtta cgtgacttca accacgtcag ctttttatgg gggaggagag 

34561 gcacctgcat atctgtggga gcaccaacct ctagacatct cctgccaagt tactgcaatg 

34621 tgcagaaaag aggaagaaag caagcccaag tcgcaaagtc tttgccagtc cttctaccaa 

34681 ccacaccttc ggtggagatg tgggtctgtg gaaatgcaga gacggactga aagccaaatg 

34741 ggccaatgag agMtctgaac aaaacattaa gcttgttttt ctttaaggca atacgattaa 

34801 cagtggtcca ttaagaatcc gtttaaaaaa aataataata atacYttttc tgactattgt 

34861 tggggaagaa taactccttt atggggagta tgaactcaaa gacatgatat tgtaaaggag 

34921 gccttgaaga agagagggac taagaaatgt gatggcgtgg tctgttacat gtgcaggtgc 

34981 tgtgtgtgag ggaagcactg aggaccactg gaggatgagg gatgctctgt ccttctcggg 

35041 tatggctgaa ggacagggat tgcccatcaa ggagtaaata atccatgagt ggactaaatt 

35101 ctttagaaac aggaactcaa atccagtagg gattaggata aaaatgtaaa ctctaatttg 

35161 cacctctccc tctctcccct aagtagtccg tttctataat tttatactaa atcaattaaa 

35221 tgtatctgct ataatattaa tcataatttt gtctgaagta gtaacaactg gaaactacct 

35281 aaaagagtca gagcaggact gtaagactat attgtagtct acactatgtc acagYggact 

35341 gcatcatgat agagaacgat actcatgatt taactgataa acacatatgg caatgaataa 

35401 aactgtgagt aagatgtctt taattttagc aaaattatct tcattctaat tttcaagtaa 

354 61 acttaaagat ttatatataa atgcacataa tttaattaaa atacatgcat atattaacta 

35521 gaaggctaat cccccaaaat gtccatagat gttatttctg agtgaggagg gtcaggaagg 

35581 ttttaatttt ctttgtacta actctattta ctaaaatgtc cacctggatt aacgttataa 

35641 taaaaatgtt accgtttaaa ataaaatgaa taaaatacac cggtatgaag gtattattag 

35701 aacatacgaa agtacaaaat caagcagcaa aaggaggaaa ttatattaag aactgaaaaa 

35761 actatgttga gaattctgtc taagaaacgc tgtagcaatt aagaaaaatc ccagttctgc 

35821 aagtaatggg agaaagatga tgacttccag aactttcacc atctactaag atgttcacca 

35881 attcctaaga agggactgaa tacgtctgca tcctcagatc tgagagtaat tttccacgac 

35941 gcatactgag caccataatg agtgaagtct ctgagatgac ctctctgacc tctctaaaac 

36001 aatcaggcac tgcaaacatg gcctatccca cactagccct tggtgaaatt caagggcata 

36061 atattagact gaaacagtga gaagaagatt cgacatggag tgcggcaaga atgtgttcta 

36121 gtccttgaag gccagcgcta ttccaagtgt ggtctcagac cagcagcctc agctgcagct 

36181 gggggcttgg aaatggaaac gtatgtgcta cgtcctggac ctgtggaatc agaatctctg 

36241 aggtggacca ggaatctgtg tttttaaaac cctgtggttg gtaacttcct gctggctgag 

36301 gacagagaga atgcttagtg aacgctgagg aatgacgggt taaataatcc ctgaatgatt 

36361 cctgagcgga tcaatttctc agccagagtt ttgaccgcag ctgatgatga acagggctgt 

36421 gaYcaccagg aagggtctga agaggaggcR ttgattgcag ttttctgcct ctgatcctca 

36481 Kgcatggagc cccacccccc aactctctta cacggaggcc aaagaaaaag ctggaattct 

36541 ccaggagagc aggacctgtt aaacagcaaa gggcaaccca aaattgtatg agtggccttg 

36601 ccctgagaag ctttgaagcc agaggttact gaacaactac cctggttaaa accctaacgt 

36661 gacaatgata acatgtgact gtatgtggct gtcttcaaaa cagcaatgat ttaggcacaa 

36721 ctgtggatct cagattgttc cgtgagtttg gtctgataag attgtatgcg tttcccaggg 

36781 gtgccataac aaagtgccac accctgggcg gcttccgcca ccgaaatgta ttttctcaca 

36841 gctctgaggt cgaaagtcca agatcaaggt gtcagcaggg ctggttcctt ttgaggctgt 

36901 gagggaggat ctgttctagg ctctctctcc cccagcttct gggggctgct gcagtctttg 

36961 gtgttccttg gcttatagaa gcatcatccc aatctctgct tttgcgttca catggccttc 

37021 tcctcatata cctgtctgtc tccaaatttc ctcttctcat aaggacacca agcactggat 

37081 taggggccca ccctactaca tctgcaataa ccctatttcc aaataaggtc acattttgag 

37141 ggaccagcta aatacaaaac attagccagg catggtggtg ggcacctgta gttccagcta 

37201 ctcaggagaa tcacttgaac ctggattaga acttcacaat tactgacacg attcaaccca 

37261 tattaggttg tttagtatta tacattagga agccatagca aacttttttt tttttttttt 

37321 tttttttttt tttttgagac ggagtctcgc tctgtcgccc aggctggagt gcagtggcgg 

37381 gatctcggct cactgcaagc tccgcctccc gggttcacgc cattctcctg cctcagcctc 

37441 ccaagtagct gggactacag gcacccgcca ctacgcccgg ctaatttttt tgtattttta 

37501 gtagagacgg ggtttcacct tgttagccag gatggtctcg atctcctgac ctcatgatcc 

37561 acccgcctcg gcctcccaaa gtgctgggat tacaggcgtg agccaccgcg cccggcccat 

37621 agcaaacttt tgtaaacata actatttatt aaaatttggt ataattaata tttaaatatc 

37681 ctttctggca agatttagcc agttttttaa gcttcattga gagaaccaca atattagcat 
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37741 taataataga atggtctatt aattttttaa aaagatacta tgtctgcatt tccaatataa 

37801 aattcacagc aatgattcag gtagaaatta aatgaggatg tcattgtcaa ccttttaaat 

37861 tatattacag gagcagggag cacacgggcc ctgaggtaga aatgaataag gaatattcta 

37 921 gaaccaccaa gaagccagtg tggccacagc agagtgagcc agtggtgctg gaggcaggtg 

37 981 agatcttgaa gggccttggc aacatgaaca ggagcttgga ttttatccac attgccgagg 

38041 gaagtatgag gtggtgttta gcagggcata aatctcatct gatctgtact ttgaggaaaa 

38101 cactctgact tctgtgtgga gaacagtcta taggaagcca agaggagagg cagggagatg 

38161 agggaggggt gacttcagtc tccaggagaa tgagctagtg atgtgaatta aggtgctggt 

38221 ggtggaggaa gtgaggtgga gaactggaca ctgggatcta ttttgaagag ggagcctaca 

38281 gaacttgttg gtggatggct atgagggaaa gaggattaaa gaatgatgct aggcatttgg 

38341 gcttaaacaa acccctgagt aaatggttat cctattatta atactaagat gggtaatgct 

38401 agggaaggag tagccttggg gagcagtgat gagttatttt cttagaSaga agccacgtta 

384 61 cctataggag gctcaacaga gtatcaaatt agaataaaat atctaatgtt ctgatttgtg 

38521 tctttatttt ttccattaag atacacatgg ctggccaggc gcggtggctc atgcctgtaa 

38581 tcccagcact ttgggaggct gaggcaggcc gatcacctga ggtcaggagt ttgagaccag 

38641 cctggccaac gtggtgttac cccgtctgta ttaaaaatac aaaatattag ccaggcatgg 

38701 tggtgggcac ctgtagttcc agctactcag gaggctgagg taggagaatc acttgaaccc 

38761 aggaggcaga agttgcagtg aacYgagatc acgccattgc actccagcct gggtgacaag 

38821 agcgaaactc cgtctcaaaa ataaaaataa acaaaaacca tgggaaagaa caattgagga 

38881 aactggggac ttgtcctgga acagaagaca cgaggacttt tggtttttta gttttgtttt 

38941 tgtttttgag acagagtctt gctctgttgc ctgagctgga gtgcaatggt gcgaactcag 

39001 ctcactgcaa cctccgcctc ctgtattcaa gtgattctcc cacctttgcc tcctgagtag 

39061 ctgagactat aggcgcacac caccatgccc agctaatttt tgcattttta gtagagagag 

39121 ggtttcatcg tgttggccag gctgtcctca aactcctgac ctcaagtgat ccacccgcct 

39181 cggcctccca aagtgccggg attgcagaag tgagccactg cgtccagcaa gaatgaggtt 

39241 tgaagggagg catgattgtg ttctcttcaa atagttgaaa ggttccattt gaaaaaggga 

39301 tttacttatt cctatggttc tagtaagttg cagagggtgg atctaagacc attgggtgaa 

39361 ctttacagag aggccagaaa gccttcWcca ggggaagaaa gtcaagaaaa ggtctcaaaa 

39421 ctgtctggca gaattctgca caggagagtc ctcaattYgg tgtagaattg gcaagtataa 

39481 taactgtaaa tccctctcaa tctaccatct acatcattag gcgacatctt catttcctga 

39541 aaattccaac acatgaaggc tcacttacta tcaagatgag caagtttgtg ctacatagtg 

39601 aatttgattt caatagqact cattttatac cttttaagag tctttaaaac gttgaaaatg 

39661 taaagatctg caagagcaac tgatcttcaa agtcccgtcg agtcacagga ttttgtaatg 

39721 atgtctaaag aggaaaattc ccatttcaga aaaggctcaa tagtatttaa tagttagctc 

3 9781 aaaaggacga ataaagttct acatgggaag ctgctgaact aSagtcaggc caaaccttca 
39841 tactcggaga agatgactgg gcctgtagac acagatattg atgataaatc atcacaaacc 
39901 acagctttcc ttaattgggg tggtgggcag ggagagcagg cttcaggttg gccttcctcc 
39961 agcaagggca cacacacaaa ggggacctga acacgtggag cttatggtca ggaagcctga 
40021 agtgcctgac ccagcacatc tggtgactag tagtgtagat tccccacaga aggaatgaca 

4 0081 aggcgtcatt attcaaagtg accttccctc cattggaata atcagaaaca gctcacaaat 
40141 aaagtaatct gtaagatgct tcatcttctc taggtaacct ctccccaaca tgatgacatc 
4 0201 acttcacatg aggcgacatg gttttgtgag gacttttctt tccttccaag tttggaatct 
40261 ggctgaaggc taaaatcacc atttacagtg tttgagccct ccttSacact ttgaagaagg 
40321 gtctgtgtaa atagccattc ctacttctca gactcgccag aggcaaaacc tgttttgctt 
40381 tggtagtaca cgctcttcct taactggtaa aagactttaa aaaaaaataa gatgccattt 
40441 ttaattgttt cataaatctg gcttagatat attaaggaaa ttgctatttt atatgtagaa 
40501 cagcggcaag caaatctcaa gagattaaaa atcatgttct tgtggttttc ttttagaagc 
40561 tgcaaatttg aattcaattt aaaaaatcat ggcacaggga caaaataaga atgaacaaag 
40621 atgggaaaaa tgtccgcagc attctgcaga gtcaagcgct gctgtgagtg agtgtgcatg 
40681 tttccaatta cacatcaggg cttggcgagc tccaggagtc aaattcagac agaacccaag 
4 0741 tgcagtgcgt tacacagaac ccaggccagg aaaagcttca cagagtcttt ctttcaactg 
40801 aggcaattgt gtagacaatt tggttcattt caagaaaaaa ataaaactga cagaagaaag 
40861 aaaaatatYa ttatattttg cctctcaaaa caaagtcaaa ttgtgttggt gtataaatca 
40921 agattYagta tttccttggc tgcttccaga ataatagtag atcacaaaga gaataagaac 
40981 aagaaagttc agttctcagc cagacgatcY aaggtaacaa agggtgagac cctcattccg 
41041 ttactcacag gagcaaacag acaaccaata ataattgggc acatgggggt ttcaacaaga 
41101 taaactgtta tgtgagtcca gggtgctaat ttcYggattc ttttgagttg caaggaaaat 
41161 atataccgat tccccacccc cacccaccca cataacaaga cacaatagtt aacagctgga 
41221 agactgtggt tagggggaaa tgtgattgta ggcttccatt tcaaatgatg gaaaatcatt 
41281 tctgaagtca ctacgaaagt tacttcacaa aactaaaaga agccttgaca ccttgtgtta 
41341 tcccatcatc taaagcaatt ctagtttgga aacacttgat tcttggcagg ggattctaaa 
41401 gaaggtgccg tatgtgaaag gctggtgtgt cagaaggagc tttaaggaac tggaatgaga 
414 61 acagccaaaa gctaagattt ctgattataa tcacaggtct ggatgtgttc aaggtaaaaa 
41521 gagatcatcc ccatcatttg caatattttc aagagaatgg ggctttagtg acactgatca 
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41581 


aagtctgaca 


tggattttca 


gctgccttcc 


aactccacat 


acctqcaatt 


caat atctc 


41641 


tatggatagt 


agaagtaact 


gccaccctga 


tgttctcatc 


ggtgatttat 


t 


41701 




aaatacaacg 






tgtccattga 


cccaacaaqc 




ttcccaatgc 




ctcaaccqat 


tttagagaca 




aat atctac 


41821 


ttataataaa 


ccatctqaqq 


qcactaactc 


tttttcacct 


ctgtgttcct 


tgaagcagga 


41881 


aatcaataaa 


atcactatta 


aattaaaata 






tcagatttct 


41941 


aaattaaaaa 


acacacaaca 


tacaatactt 


tatttacaca 


agaRttatta 


gaacccaggg 


42001 


ctaacaccca 


aatqaqgtca 


caaataaatq 


aaaatctgtc 


tcagttattt 


t^ttcttac 




caacacacaa 


acaqaqcaaa 


qcaccaocta 


acctctqatq 






42121 


ataaaattcc 


aaaaqqcatc 


tcagtcattc 


tcttat taa 


atacaaa*^ac 


g caac gg 


42181 


ttaaaaattc 




attttagctt 


ctttatatca 


aact 

caaaga 


agcc caca 


42241 


aaatoatcaa 




aaaat^'aa^^ 


tttt'^aa^at 


ctt^ccaat 




42301 


actttttcct 


tcaatcctct 




attca^ttta 




^^aac*^tt^t 


42361 


qccttacaqc 


tataatattt 


tttaaaaqqc 




aattaatata 


ttt^'^cca'' 
ggccaga 


42421 


caactgtatt 


aaaattcttq 


^^^^^.^^^^ 


tqttataata 


g uagcctgtg 




42481 


tcttcttcaa 


aaatat tac 


a ttatt ct 


tctct tt 

gc 


ctggggtata 


ca g g 


42541 




tqtacatt^c 


aatacctcat 




ttt^^^^'^^^ 


cacaaaag c 


42601 


ttcatatcat 


cc caaa 


G taaacta 
a'^ct^ctcaa 




cagagcc 


ca aac ca 


42661 


ttgcatatac 


ctcttatact 




ct^at^ttt^ 




a ga g cac 


42721 


agtttttatc 


ctcttaaaat 


t 


K 


t ttt t 


ca gcac a 


42781 






a t 


tttttttt 


tt t 

a acaagg 


aacaag gc 


42841 


ctcaccctcc 


tcccttctct 


cccca 


acacagaagc 




cac a ac 


42901 


taaacacttc 


acttatactt 




aaagag gc 


aaatacatgc 


aaa aaagcc 


42961 


tttataactc 


ccctctctcc 






caagtatttc 




43021 


cactctaata 


aaqqtctact 


ctqaqcttca 


ttctcaacat 


acagtcctgg 


attatttac 


43081 


ctaccaatat 


tcacctctac 


atgaaccagt 


tt 




aaaacaattt 




gtgatttttt 


ttaatgtcac 


aatttttctt 


tttaaRqttt 


aaaaaaaaaa 


tt tttt a 
g g ga 


43201 




agtttctatt 




cacctatttc 


aqaqtcc 


aaag cacac 




atcaqaaqct 


aqataaaca^ 


acatata^tc 






aaaaagca 


43321 


qactqcttaa 






ata cttcat 
a age ca 


t'^ttt 
c ag c 




43381 


Wttttacact 


tcacacrtat 


taqqata ct 








43441 


caaLgguggc 


aaaaatat a 
ctctat'^ aa 


a aaatta^a 




actgctgqtg 




43501 






aaca'^tat^'^ 






aata aatt^ 
aa agaa a 


43561 


ccatatqatc 


caqcaattct 


acttct t 


atatattcaa 


aaaaattta^ 




43621 


caaaqaqttc 


tttqtacaqc 


catqttcatq 


qcaqcattac 


tcacaata c 


caaaa^^tca 


43681 


aaqcaaccca 


atcqtccatc 


cacaoatqaa 


taaat a 
g a aca 


aaa g ggca 


t 

acaa ^^^^ 


43741 


t t 


tctttaaaaa 


gaagaaaatt 




tt ^ 


ga 5^^/^ 


43801 


tttttaaatt 


^t^t ttaca 


ttttaatcgt 






gaca g gca 


43861 






taggtaaacg 


g gcca gg 




cacc a caa 


43921 


cccatcacct 


aaotatta 








c c cccacc 


43981 


cccaccatcc 


cccaacatac 


t t 


aatcttaagg 


ccc ^^99 


gcg gaaa a 


44041 


aaccaqtcac 




t t t 


attctactta 


t 


c agag ag 


44101 


ccaaqttcaq 


aaaqtaqtaa 


taatggttgc 


caaaaggtgg 


R t 
g gca gcag 






atattqatta 


ac t aa 
^aatata^tt 


tctgttttgc 


aagg aaaga 


ag ccagag 


a gctgca 


44221 


ccaacaatgt 












44281 


atgttttatg 


tqtttqccac 


aaccaaaaat 


aaa attaaa 


ttaaaaaata 


aaa^atttta 


44341 




cacttaccaa 


acccactqta 




tcRtgaatat 


^^^g^ attt 


44401 


tcaaaacacc 




tcccttattt 


qttcattcct 


tcatqtataa 


cccacaatqt 


44461 


gttaggttat 


qtqctqqqca 


qcctqqaaqa 


cccaaaqacq 


aatraaaaaa. 


gtaccctata 


44521 


tttaaagcct 


tqcaaqqaca 


taaatatcag 


taatataaaq 


taaaatatqt 


tcactqctqa 


44581 




acttcctaqa 


tttttcaatt 


caattqcaat 




ttqccatqtt 




gatcccacaa 


attcacttqt 




taac t ctt 


attatcata 


ca ct cttc 


44701 


tttcataaga 


ggaatacaga 


taaaaYgcta 


tgggatttca 


gggggagaaa 


gaacttctta 


44761 


ttaaattcat 


attttcaacc 


aataagaaaa 


cacaattttt 


tttaatcaac 


actatcttct 


44821 


tggtgctttt 


ttcacatctc 


atcctctttc 


ctcctttacc 


ctgcctaagc 


aaccatttct 


44881 


ggacccaatc 


ttgggttatt 


acatctgacg 


gcaagtgtta 


tgtgggccat 


ggaccatgag 


44941 


caattagccc 


cctctctact 


cctagggatg 


gagtacatgt 


gatgatttcc 


atcagactca 


45001 




cttgctggta 


agttacccct 


aaaaatcgat 


taatggactc 


tccaggtaga 


45061 


cactgaggta 




aaacctRact 


tatacttagg 


ggctttggaa 


gcccaccaga 


45121 


gtcatcaagc 


catatctcca 


Yctaggctct 


atactggaaa 


gctggtcagt 




45181 


agaacaagcc 


atttgggaaa 


acagcccaca 


tgaaaagaac 


tgtggtttca 


agaagagtca 


45241 


ttccatccca 


ggtttaaaat 


actcttctcc 


tctttatcta 


ccttgaggac 


tcctatctat 


45301 


ccttccatac 


cagatccagc 


ctcatcttct 


aagtgaagcc 


ctcctctggg 


acgtcagcSt 


45361 


ctcctctgca 


tcccctagca 


ttttatgcat 
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Yatccaccac 


aaac aga 




catggYtttg 








gagcc gaga 


cc ag ggg 


attca'^t'^ca 
a cag gca 


tgttgaaggg 


aaaaataaat 


acataaacaa 


45541 


acaagacaat 


ggaag eg 




Jz^^ 


aatacaaaat 


aaaaattata 


45601 


caastctcta 


a caagc g 


^^^^ 


gc aaa ga 


1-t-t 1- 




45661 


cactgaaatt 


tctttttgca 


agttaag c 


cgca aacc 


R 

aaaacaga a 




45721 


aatgaactaa 


acagacaaac 


ttauggctga 


caaaa c ca 




g gagagga 


45781 


aagtaaaatt 


caatgtgtcc 


ttgaaagttc 


gggg c aa 


ga a acagg 


ceaa ggaa 


45841 


agaatagaga 


cttcagaaa 


aaac a gc 


^ ^^^^ ^ 


ga aac 


gaaag gg 


45901 


tcaaagatca 


tttaatgaga 


aaaaaaggag 




^t'^ t^tata 




45961 


ggtatacata 


tgcaaaagaa 


aaaag ggg 


c c acc 


^taaaattct 


caaaaattaa 


46021 


ctcaaaatag 




aaaaaa aaa 


age aaaac 


c aaaa c 


tat a aaaa 
tctt'^aatat 


46081 


ctctaaaatg 


cttagggaat 


aac ca aa 


cac gga 


cacaagcc g 


c gaa a 


46141 


gacaccaaaa 


gcacaggcaa 


cataaataaa 


agagacaaa 




aaaa aaa 


46201 


cacttttgtg 


agtcaaacaa 


gctatcaaca 


aagtcaaaag 


aca cc a a 




46261 


aaatatctga 


aaataattta 


tctaataagg 


gattaatatc 


cagaaaaaaa 


aatatatata 


46321 


tatatgtata 


taacctacaa 


ctcaacaaca 






aaaaa ggg 


46381 


caaaggactt 


gaacagaatt 


ttttccaaag 


ataacactaa 


tggccaataa 


g aca ga 


46441 


ttatgctcaa 


catcactaat 


cattgggaaa 


atgcaactca 


aa ccacaa 


gaaa ac c 


46501 


gtagccatta 


gtatgactat 


taccaaaaaa 


cagaaaa 


acaag gc g 




46561 


ggaaaagttg 


caaatcttat 


gcactggtgg 


tgggagagta 


aaa gggaaa 


g eac a aa 


46621 


aaaacagtct 


agtgattcaa 


aaattaaaaa 


tagaattaac 


atatgattca 




46681 


atgaagatag 


caggaaattg 


aactgatatt 


tgcacatcca 


tgt teat age 


agca a 


46741 


acaataacca 


aaaggtggag 


gcaacccaag 


tgtccatcca 


eagatgagtg 




46801 


atgtgatata 


ca cat at cat 


gggatatctt 


cagccttaga 


aaaggagaga 


^t tt ^^"^^ 


46861 


tatgctataa 


cacagatgga 


cctttggaaa 


gttaagaga 


g c a 


t t ' ^ll'^ 


46921 


aagagaaata 


agccatccac 


aaaaggacaa 


atgctgtacg 




^ t^^ a^'^ 


46981 


ctagggcagt 


caaattcaca 


gaaacagaaa 


g aaaa ga 


gg ccagg 


aaattaaaaa 


47041 


agggtagaat 


ggaaagctat 


tgtttaatgg 


gttggagtgt 






47101 


agttctggag 


atggatgatg 


gtgagttttg 




tgaatgtatt 


taataccact 


47161 


caactgtaca 


ctaaaaaatg 


gttaggatga 




gtatatttta 


ctataacaaa 




aagtcataaa 


aatcaaacat 


ggttcttctc 


attaaaaaaa 


aaagaagatt 


cctgggtcta 


47281 


gttggttctc 


cctccttctt 


agttcttgga 


tcaaggagat 


agaattgtgt 


ccaaagtgac 




caagtgacct 


acggagtaat 


gggcttccat 


ctagccactc 


cctgctttat 


gaetgagctg 




cactgggaga 


ggcattgtca 


ggggaaattc 


caggtttcta 


gaattggtat 


ctagccagaa 


47461 


tttgtcccag 


ggctaagtgg 


ctaaggcagt 


gaactagtct 


gteagcttet 


ctgttttaaa 


47521 


tggagtcatg 


taattccatt 


acataccctt 


tgctttcaac 


tatatggeea 


aaeatgaetg 


47581 


gccagcaggc 


tgggggacag 


aagacatgtg 


ggcttgtgct 


tgttaYctgg 


aattgaaaag 


47641 


agctcttggc 


atggttcttg 


tt caagc tec 


atggttgact 


agcaeatgae 


caceaagaga 


47701 


gctactgtac 


ttctccgaga 


ctcattttct 


cttctgtaaa 


aaatgaetga 


gtttctttca 


47761 


cagtgcataa 


atcttttcct 


ccaggaacag 


agactgccca gggcctgtcc aagacaacta 




ggaaaccaga 


gcagcagtta 


agccattcct 


gggcaggaat 


aaatgagcag 


atgteagetc 


47881 


tcagaatctc 


agttttctca 


ctcataaaa 


ggaaaagaat 


aetatttctt 


ttatgtcatt 


47941 


caccatattg 


tgagctgaag 


gcactttaac 


gatgctgagg 


ctgtaaacta 


atgcataatg 


48001 


tgatgtttat 


cattcttcac 


ctatgttt a 


gttactgagc 




accagagagg 


48061 


acgcaacaaa 


aatcacaact 


cc agaac g 


cacagtctaa 


atggcaacca 


ctagcccctg 




tgacaactga 


tcacttgaaa 




ccaaattaag 


atgtgctgta 


ggtgcaaaat 


0 Q1 
^B9 1 


atatgctgga 


tttggaagat 




aaaagtatct 




ttttatattg 




attacatatt 


aaaaagattc 


ca ggc 


atgctggact 


cttcagatat 


atcctgggag 


^ a^«i 


attgtattaa 


aagaaaatac 


a ga ggaaa 


ttagtgtagg 


tagttactag 


aaaatgtaaa 






tcacatccta 


g cac g 


tctagacata 


taaatataac 


tttgagttat 




ttttaagcat 


gagaagacag 


f ^^^^''^^ 


ataaagtgaa 


aaagctaaag 


attaaagata 


^ft4«1 


cttacaaatc 






gtcattttga 


geatattcet 


taeeattgaa 


48541 


gcccctgcct 


gtctggctcc 


tctgcccacc 


ttgttttgct 


aatgggteat 


getcetatat 


48601 


gtgcatagac 


gaggagcggc 


tctcagggac 


tgaccgtaag 


catagtcage 


acageacgta 


48661 


tacccatgct 


ctgaccccag 


gttggcgtct 


tggagggtga 


agcattttcc 


cagtcettet 


48721 


ttgactcatc 


acagaggtat 


aaaacagcgt 


ctccaacacg 


tgattttttt 


tttttttttt 


48781 




gagtcttgct 


ctgtcgccca 


ggctggagta 


cagtggcaeg 


atcttggctc 


48841 


actgcaacct 


ccaactccca 


ggttcaatca 


attctcctac 


etcagcctcc 


caagtagctg 


48901 


aaattacagg 


cacacaccac 


cacgcctggc 


taatttttgt 


attttagtag 


atacggggtt 


48961 


tcacaatgtt 


ggccaggctg 


gtctcaaact 


cccgacctca 


agtgatccac 


ccatcttggc 


49021 


ctcccaaagt 


gctgggatta 


caggctgagc 


cactgtgcca 


gttaaaagaa 


aaaaaatatt 


49081 


caaacatatt 


tggtttctta 


ttgaggtgaa 


gttcacataa 


catgaaacta 


acagttttaa 


49141 


agtgaacaat 


tcagtagcat 


ttaattaaca 


atgttgtcca 


accaacacct 


ctgtetagtt 


49201 




ttgtcacccc 


aaaaggaaac 


cacgtgctca 


gaaggeagtc 


acteeaeate 
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49261 acccctcttc ctagtcctat gctttgcaca caactgactc accctgacag ttataaccta 

49321 aaagttaaat aaaattatta gtaaactgca aacccctggc aagtgaatta cttaatatct 

49381 cttaaaatcc tgacatcaaa tgattcccct gtRaggccat gagatgataa gaacatccac 

49441 ctccctctct tgctccagtg gaatgaaacc ccctcctcaa ggagggttac actcaggcat 

49501 aggcccgatc actcatcata tattgacaga tgtgcaggcc tgagaccccg ccttctggag 

49561 ctttctaatg ctccttgttt ctcaatccaa tccttgctag gaattaggct cttctgctga 

49621 atgtaaaatt tttccatctg cctccataaa gtatgtgcca cctgtcagct gactggtcaa 

49681 ccccaagctc aaagtatatg gacattttgg gggtgacaat ggtgggcata catctttagc 

49741 aaagattaag aaggtaccag aggaggacca gcacaaccca gcaatgcagc agaatYcatc 

4 9801 aggaaagctt tggctctggg tggagctcct tgctttagct ctgcctttga gcgcttgggt 

49861 ggctggggag gcactggggt ggaagggaag gaagcaggaa ttcgtgtgct gcaaatttca 

49921 ctacacttga ccacaacttg gagtggaaag ggcactgtgg tRcgtctatc tggccttcct 

4 9981 ttacaagcct taaaatgctg tggaatccat ctggatggaa gacgccatcc aattataaat 

50041 ccttaggata ctttttattg ccctttaaga gttcYgatgc ttgctcttcc atRcccatat 

50101 gtttctgctt aagaggcttc attcaaatca aattggaaga gaagtccaga ttacccaact 

50161 acaattacct gagcaaaaat cccctcaatt atcttggtaa tccagataaa ttcggagtac 

50221 aaaggtgagt tgcctggata gcactggcta agaactgctt tgtctcccat ttcaccttgc 

50281 attattataa tctacttgat tacacttcat ttataacaat agggagggct tgttttatgc 

50341 cacacattgc cacttgctca agtgtaaaat aaaccttaaa tgactgatgt actgctaaat 

50401 ttatctcaaa aaacaatcat aacaggaatc ttaaggcaat aaagtcaaac aaatagaatt 

50461 ttacagaatt taaaaattat tttgtgagtg atttatgcta aaatacctca aatttatttc 

50521 aaaaacagtc ttacttaaaa ctgacttaaa atatatgtag gataaataat Ytgtagtgaa 

50581 tttaacgaaa cttcagcaat tcaaatgaat gcacRttatt cttcttaaaa acagatatcc 

50641 tgaaaatatc catcaggata aatagctaat gcatgtgtgg cctaatatct aggtgatggg 

50701 ttgataggtg cagggaacca ccatggcaca tgtttacctg tgtaacaaac ctgcacatcc 

50761 ggcacatata tcctggaacR taaaattaaa ttaaattttt aaaaagcaga tatttggaaa 

50821 gtaccaatag cttcaaagta aaaatactct Ncactgtttt tgtttgtttg tttattttcg 

50881 tttttgtttt tcttttttga gatggagtct cgctctgttg cccaggctgg agtgcagtgg 

50941 cgtgttctca gcttaccaca acctctgcct cctgggttca agtgattctc ctgacttggc 

51001 cgtgccacca tgcccagcta atttttttac atttttggta gagatgatgg ggtttcactg 

51061 tgttggccag gctggtctcg aactcctgac ctcgtgatcc acccgcctca gcctcccaaa 

51121 gtgctgggat tacaggcgtg agtcaccgcg cctggcctat gcactgttaa ttttatcacg 

51181 acagttataa atccatcaag atgtcctgtg atttttcaac acaaattact caaattatga 

51241 acttctctgt gtgtattttc cactggcttg tttatgaatc ctgctgatat gtctgctaca 

51301 atttctccaa aggctaaaag atgtttacag gacaaatgac atcatgacca tgtcaggctt 

51361 aaatgtgttt atatcagttg ggtttagagg gtcttcaaaa catggcttta aaatctcagt 

51421 tattatggag cttaatgtgt tcctgatgtt tctggagtMa tgtatttcta aaaatagaga 

51481 aatagattgg ttaaaagaaa atgaattgtt ggtctaatct gtcttatgca tggctaacga 

51541 aatgcatttt cactctcttc cgtgataata cgacacttaa aactgtgagg accaatgggg 

51601 tgataagaga caaaataaac agtaaataga tactccatgt gagttactac tctgtaatag 

51661 atagaaaaga tattttatag aaaagggaaa atacccacag acatataagc aatgtcctgt 

51721 ggttgctatc aaatttactg ttagctgttt aaaactgcct taaaggtttt ttaaatgtgt 

51781 cttttctacc atagattctt tagacgacag ccatcaagca gcaaatacct cgcaggaggc 

51841 cattgtgttt ctatttgaac ttctatggat gtcaagctct gttcttcaaa atttagccct 

51901 acagatgggt ggcccaccgg agagaattaa tctcactatg cagccaacac taccctttct 

51961 ggatctttcc ccagtagtgt ggattctgct ctagtccaga attatgcaga ataagtatac 

52021 ttcatcttag acaccataat tcttcaaata tctaaaatgt atattatgct ctccaaatcc 

52081 tggtcatctc caggttaagt atctctcaaa aacagtccat cagataatat gattttaaat 

52141 ctttatcccc ttggtcatat acccacaatc tcattctatt taccaatatc cctactgatg 

52201 cataaactac atatgtatat gtgtgttttt ttttaaaaaa aaagcaattc cactgtaatt 

52261 ttagcagagg ttatttccac atagagctat gtatatgtgt atatacacat atctttatat 

52321 tatatagttt tctgtgcttt cccagttatc tacaatgggc atatatgact tttatagcca 

52381 ggtaattcaa tattcattca ttcagtgaac atcattgagc atccactatg ggctgggaag 

52441 tgaggatgga gccacaaaca agcaaaataa aaacccctgc tcttgtggag ttcccttctc 

52501 attggaagag acgtacagta aagaaaaaaa taaaagagta aagttaatag cctgtgagaa 

52561 ggggatataa aagtgggaag gaagacaggg tgatgtgtga gaaacacttt tttttttttt 

52621 ttttcagatg aagtctcact ctgttgccca ggctggaggg cagtggctcg atctcagctc 

52681 actgcaacct ccacctcctg gtttcaagca gttctcctgc ctcagcctcc caagtagctg 

52741 ggacttcagg catgcaccac catgcctagc taatttttat atttttagta gagacaggat 

52801 ttcaccatgt tggccaggct ggtcgtctca aacttctgac ctcaagtgat ctgcctgcct 

52861 tgacctccga aagtgctggg actacagcaa tgagctgcca agcccggcct ggggaacact 

52921 attgaatagt gtggtcaggg aaggctttac tgtgaaggtg atgaagaccc aaagaaagtg 

52981 aggaactaag tcataaatga tgggtagaac attccggatt gcgggagcag caagtgcaaa 

53041 ggccctgggg tgaggaatac caagaatgtt tgtgtgggca gagcagaaag aaccaagggt 
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^^1 fil 


aaagcattag 


gagatgaggc 


gaagaag a 


accggga ca 


ggtggttagg 


gccc caggc 




cctggaggaa 


ggctggcagc 


gc ggagcca 


ca ggac ga 


caatgtctag 


cag ca ccc 




ggctccctgt 


gaagaacaga 


ccc cgc g 


cagagaagcg 


aggacaa a 


ggaaagcaga 


53281 


gtcatgatcc 


acgcacagga 


tcacggtgac 


ttggaccaga 


aagagagcag 


tggagctggg 


53341 


gagtgactct 


tttcagtatt 


tctcaggtgg 


taccagtggg 


atctgcagag 


aatacatagt 


53401 


taaaggaaat 


gtctggaact 


aaacatgact 


cttcaaacgt 


gatcctactg 


tggcagaaga 


53461 


gaggcgactt 


tacatttcct 


ttcccaggat 


atcttattct 


aatgaacaca 


gaaaccgtaa 




aatcaagcca 




tgaaaatgat 


tatataaggt 


acacagccca 


ggctggattt 




gaactgatga 


gggaaaaaaa 


atctc agtg 


aaaa cc c 


tctctaacca 


cagccttctc 


53641 


tccttccatc 


tctgcctcac 


ccttgctctt 


tctgaagcct 


tcctggatgc 


tcatgaagct 


53701 


cYctccttct 


ttgactcatc 


ctattttccc 


aggacMttta 


tcaactcagg 


gctcttctcc 


53761 


cctccccaca 


gtcagctcac 


tigtgctctga 


ctagYtccct 


gaaattgctc 


atcctcttac 


53821 


taaccaggaa 


gcttctaccc 


gctcttggtc 


ttctttctcc 


accccagtgc 


caaagaacta 


53881 


agcactgatg 


gagaaaaagc 


atgcggctaa 


tactgtctgt 


ctctaccaca 


aatctcataa 


53941 


tttttaaaat 


ccatttggaa 


tttacattca 


ctccttaact 


tcttttcatt 


cactcctaat 


54001 


tcacttccta 


tataattctc 


tacacagctg 


ctgaaaacct 


tcctttctgc 


tgcctcttac 


54061 


cccaaaacta 


ctctcacatt 


actcacaaag 


gttgaccttg 


ttttctatWt 


acttttttag 


54121 


agtaaYgaag 


ccataaaatg 


taagttcttt 


caacctggac 


ctcccactct 


tgggatttct 


54181 


caatagcttc 


acctgtattc 


tcccccttca 


ctcttgactc 


agaggtaaaM 


gctcccttgc 


54241 


tcccaaatcc 


tttagcaacg 


taggglitgat: 


aattgctcct 


ttccaaattt 


aatccctgca 


54301 


tctgcgcccc 


tgagacagac 


actgccatga 


tttcccagct 


tgctgtcttc 


tccctcctgc 


54361 


atctcctccc 


tttctctgtg 


acggaliattt 


tccctgtgca 


agagaaattt 


cttctgccct 


54421 


aaaataaata 


atccaaaaca 


tctcttgact 


ccacttaagc 


taacattcaa 


tatctcactt 


54481 


tcccttgtat 


taacaaaagt 


taattgagtc 


agtacggtgc 


tacaattctc 


agacacaacg 


54541 


ataaacatga 


tagagaccat 


caacatgcct 


tggcattttg 


cctgctttct 


c't'tcaatL'tac 


54601 


ttgcaacttg 


gcttctactg 


agctccactg 


aaaaagcttt 


catigacatga 


ccaca'tccaa 


54661 


tcccaaagaa 


acLgLcccct 


tgcccagtgg 


tcttcaaaac 


cctccaactc 


cacagtgtct 


54721 


cattaactcc 


aactcatttc 


ctactaaaca 


tctcttctca 


gctcttcaaa 


gtcaaagatt 


54781 


gcgaaccaaa 


tctacagact 


tctcctctct 


tctcctttgg 


gtccaactcc 




54841 


atgaaacttc 


ttattcctag 


aaatggtgtc 


tgcacctttt 


aggcttccag 


gctYgagcct 


54901 


agagtcatct 


ttaactcttt 


ccatacccag 


actcccaaga 


cccagtcagc 


tgtttaggtg 


54961 


tctcctctgt 


atgcagcact 


tactgtcatt 


gccctggtcc 


atctcccaga 


gcccacatct 


55021 


tgtcccactg 


tctgcaaagc 


ctttcctata 


tgctggatat 


aagtgcaaac 


tccccctaaa 


55081 


atcccattga 


agtttttact 


tttttctatc 


ttcttttttt 


acttcttttt 


atcttcctct 




actcacacag 


aagtgctcca 


att.atgagga 


ccttatttta 


ttccctttac 


tgticaaacac 




aaagccaaca 


tgccatacga 


aaaga'tagta 


ctgtgcttcc 


t'tcag'tgcca 


ctctagcacg 




cccactccag 


cacacccttc 


ttgtgcgggt 


gtatgctcat 


gtgatactat: 


cacaggcaac 




acttactatg 


caatacgttt 


tccacaatca 


caatatttcc 


acaatattgt 


gatiatgagct 




cattcttcag 


gccttgacac 


aagcgcccac 


aaacctaaga 


gaatgcattt 


tcatttttat 




gatacacagt 


tttcRactgc 


tccactctgg 


ctgccaaggc 


attcctcctc 


caaatcacRa 




tgtctatatc 


aaaagaagaa 


caatgaggga 


gagctggagg 


tagcgctaag 


aaacaacctg 




gtccagctcc 


caggtctcat 


agagaacact 


cagaacacgc 


ttgttagagg 


tatctgctcc 




tggcctcagc 


ttctcagaag 


caattacaca 


tcttctgggt 


aaacgtggag 


tgtgcagcac 


55681 


agaataccaa 


agctgagctt 


cccattgtac 


ctggttgttg 


atctcctcgt 


tgacctccct 




cagctgaata 


tttcttaaga 


acagtgttca 


attaatccaa 


ctatccctgg 


tggctggcag 




gatgaatcag 


ttatgactta 


tttcatgttt 


gtgggtgggt 


gtatgaagtg 


g cc caaag 




gagatttttg 


ctctctgcag 


acagtctgcc 


tgtctgtagg 




oca cc 




tgagtgggaa 


tatgagccag 


ggtgttggaa 


gaaggaagag 


tccaaaagat 


g c caggga 




atgtcccata 


gccattggcc 




tagataactt 


c gagccagg 


caccc aaa 




g gc aca g 


aaaagticaug 


gatttttttt 


ctaacttgtt 


'^^^^^ 






ggca gaa a 


tctagcttcc 


catgttgcag 


ctatggattt 


^ ^ gcca 


aaa a 999 






gagagggaag 


aaggcacgaa 


gaugca't'tcc 






56221 


catggcctca 








acaaacccca 


ccatcaactc 


56281 


accaaaatiga 


agagtgtctt 


agacttattc 


tcacaagaca 


agagggaacg 


ctgcctctaa 


56341 


ggagtctctc 


tgggacacct 


gggagagatt 


tatttaatgg 


taaagcaagg 


aaaggaaaag 


56401 


atgaacagtg 


acctgcaaat 


cattaaagag 


ggcgactcga 


tcataggcaa 


gggcttttat 


56461 


tagtgctaaa 


attccacttc 


aaagtttgct 


gattcatgga 


tgctttttgt 


tttgctgttt 


56521 


aYgttgtaga 


aactgatttg 


attttcatag 


acaaaataac 


taggcattta 


taaccacata 


56581 


taggtatgaa 


agaaagataa 


aatgatgccc 


ttagcccagt 


cagaaatact 




56641 


aaaatagttc 


cctgaatcta 


aYaggccaac 


aatattccaa 


caccagaacc 


tttgctacca 


56701 


tagccagaat 


ttttaagact 


gcaccactgt 


taagtagagg 


caatttttaa 


tacttagggt 


56761 


ctaaccacaa 


ggcataaaat 


gcaaaaaaaa 


ttacataatc 


ctgtactttt 


taataattaa 


56821 


atgaaaatac 


atagactcta 


agcaggaaaa 


tatctgagat 


ccttaaaacg 


gtataatcat 


56881 


aattgttcta 


cattggtaat 


atgtgtgatg 


atatttttct 


ccaaaaaaaa 


ttctctacca 
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56941 tttcattctc tttRatttct tctagcccgt aaggtctagc aaagaggaat acctgaaaag 

57001 atgcaaattg tacccctgtg ctgttcaatg acagcaaaaa ttgcagctcc ccaaccccac 

57061 ctcttatgtc ccctgagcaa tctgtgatca cttgagggtg aggacagtat attgccccaa 

57121 taaactggct ttccaccaag taccaccaag aaatcaaaac agcttttatt ttattgtgtc 

57181 atcatatata agttaatagt tcataattag ttcaaaattt aaaattatgt tctaaaaata 

57241 aagctttcta agtcaatgta ggcaacRatt ccaagatcct tatagttttc ctctcttaaa 

57301 cagagaccat tatcatatga tttttttttt tttttgagac ggaagctcac tctgttgccc 

57 361 aggctggagt gcagtggtgc gatctctgct cactgcaagc tctgcctcct gggttcacgc 

57 421 cattctcctg cctcagcctc ccaagtagct gggactagag gcacccgcca ccacgcccgg 

57 481 ctaatttttt tttttttttt tttttttgca tttttagtaa agaccacgtt tgaccgtgtt 

57541 agccaggatg gtctcgatct cctgacctca tgatctgcca gccttaacct cccaaagtgc 

57 601 tgagattaca ggcgtgagcc accgtgcctg gccagtcata tgatttctaa agagaaactg 

57 661 aacagttatt tgtagaaact ctgcaaagca tgactttggg acgggcacag tggctcacgc 

57721 ctgtaatccc agcactttgg gaggctgagg cgggcagatc acgaggtcgg gagatcgaga 

57781 ccatcctggc taacatggtg aaaccccgcc tctactaaaa aaatacaaaa aaattagcca 

57841 ggcatggtgg tgggcgcctg tagtcccagc tacttgggag gctgaggcag gagaatggcg 

57 901 tgaacccagg aggcggagct tgcagtgagc caagatcgca ccaccgcact ccagcctggg 

57 961 caacagagca agactctgtc tcaaaaaaca aaacaaaaca acaaaacaaa acaaaacaaa 

58021 aaagccatgc ccaagcccac ccctgaccag tgaaatctcc gggattgggg cacaggtata 

58081 atgtttttat aaacatcttc ctagatgttt cctgcatcca gggtgataaa tcatcatact 

58141 aaaggcatca tctgaaggct cctagagtat agagaagttc agaataagga ccccaaacta 

58201 ttgcacatcc tgacactaac tagctgtagc tgtgcaccct ggacagggag agatctaaac 

582 61 agctagttat cttacttcac cRaaatctac aactaaagat gtctatccac gtgttatagc 

58321 tttggcttcc ctaatctaga tgttcaatct acttttctgc aatttggcaa agtaaaacac 

58381 acatatacag taatgcacaa tgtcctgatt gttggggttt attatttgtt ttgcatttac 

58441 gcttcttagg tatgtaacat gccaaagtca tgggagctga gtcttttgaa acagaggctt 

58501 ctttgggtgc aaatattaat tttctatgaa aaaaagagaa agtacaaaga aaagaaaaaa 

58561 cagcagtaat acagaaatga atatcttcct tgtctaggac acagtgtttt ttttaaccta 

58621 gatttctttg aacacagact gacacaaaaa Ltaaatacta ttgctccatt ggagaggtgc 

58681 aaacctgggg cggtgagagt gaggggaagc gagtgagatt gtcactacac tagcttctgg 

58741 ggacatggag acagccagLc actgagcccc tgccttccct tggccatgca gatgcctctg 

58801 ataggttaca cggaaaaatt ctacctcaca gcagcccatg gggtggagga aaaataggaa 

58861 cttagttgcc tacattcttc ctgcatcccc ttcctcattg gtcagagtca ccctgMatgt 

58921 ctgggttgtc acctgctccc aaggtgacca ctccaaaagt caggttctac gcccggtggt 

58981 ggggtgcttc atccaagtac agaaatgcca agaggatccg gaaacctttt gacctggttg 

59041 ctgagtcaca ggaaagaagg gccagccctc ctgggacagg taacaggctg gccctgagct 

59101 gtggaagccc tcacacccag aaggagtcaa tctgctgggg tggccactga gaccaagcag 

59161 gagctgaggg tcctggagta atagatggag gcccatgaga tgtgttcaga cacatagtgt 

59221 gtttgtcaac tgcaatgtgt gtgtgccccc aaaattcatg ggtttaaatc cttaccccca 

59281 aggtagcagt atgaggagga ggggcctttg gaaggtgact aggtcataag gatggaggtc 

59341 tcatgaattg cattagtacc tttacaaaag ggaccccagg gagattccct cacccttttg 

59401 tcacagtgtg aagacccagc aagaagatgg acatctatga atcaggaaga gagccctcac 

59461 cagaacccaa ccctgctaca ctctgatctc agacttccag cctttagaac tgtgagaaat 

59521 aagtttctgt tgtttatatg ccaStcaatg tatagcactt tgttacagca gcccaagcta 

59581 agacagccta ttatgtatga gcaggtggct ttgagataat tgagaaaaca gagactataa 

59641 ctacaaccag ccagagatgg gatcaYtgca aatgggactc gggttcacac aaggtttgtg 

59701 acacacacac aaaaaaacat aaggcagata aatcagcaag ggtagcagtc aggaattgac 

597 61 agacgattaa tggattgagg aaacagaata gataatttaa aagcaaacct ttgtacacaa 

59821 aagaattcag gaggtgataa aagtaacatt ttcatatcag ttggaaagta agttattcaa 

59881 tagcaacaga acaatcattt attgaggaaa aaWaactatt tctttcctgg ttcaatcttg 

59941 gggtgttgta tgtttccata aatttatccc tttttttctg ggttttctaa tttttgtgca 

60001 cagaaatgtt ttttaatagt ctctgaggtt ttttttttat ttctgtgggt tcagtggtaa 

60061 tgtccccttt gtcatttctg attatgctta tttggatctt ctctcttttt tctttatcag 

60121 tctagctagt tgtctatcaa tcttatttat tctttcaaaa aaccaacttt tatttttgtt 

60181 ggtcttttgc atggtttttt tgtatctcaa tttcattcag ttcagccccg attttggtta 

60241 tttcttttct tctgctagca gtaagtttgg tttgctcttg ttttttctag ttcctccagg 

60301 tgtgacatta ggctgttaat ttaagatctt tccaactttt tgatatcagt gtttagtgct 

60361 ataaactttc ctgttaacat tgctttaact gtatcccaga gattctggta tggtgtatct 

60421 ttgttttcat ttgtttcaaa taatttattg atttctgtct tagtttcatt gtttacccaa 

60481 aagtcattca ggagcaggtt gttcaatttc catgattttg agagatattc ttagcattga 

60541 ttttattttt accgtgctgt ggtctgaaag tacggctagt atgatttcag ggtttttttt 

60601 catttgttgc aaattgtttt atagccaagt gcgtagtcaa ttttgaagta tgtgccacgt 

60661 gtaggtgaaa agaatgcata ttgtgtcgtt gttgggtgca gtgttctgga gatgtctgtt 

60721 aggtccattt ggtcaagtgt caagtttagg tctcaaatat gttaagttag ttttctgcct 
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60781 tgatgatcta tctaatactg tcagtgagat gttgaaatct cccactatta ttgtatggtt 

60841 atctaagtct ctcactaggt ccctaaaact tgatttatga atctgagtgc tccagcattg 

60901 ggtgcataca tatctagaac agttaagtct tcttgttgaa ttggaccctt tatcattata 

60961 taatgccctt ctttgtcttt tttgatcatt gttggtttaa agtctgaaat tgaaacgctg 

61021 aacagaccaa taacgaatta caaaattaaa tcagtaatat aaagactacc aaccagaaaa 

61081 agccctagac cagagagatt cacaaccaaa ttctaccaga tgtataacaa agagctacta 

61141 ccaatcctac tgaaactatt acaaaagatt taggaggaaa gactcctcct taacctcatt 

61201 ttatgaggcc agtatcattt tgatacagaa acctggcaga gacacaataa gaaaagaaac 

61261 ttcaggccaa tattcatgat gaacagggat ttggaaatcc tcaacaaaat accagtaaac 

61321 caaatccagt accacatcaa aaagctaatc caccatgatt aactacacat tattcctggg 

61381 atgcaaggtt ggctcaacat atgcagatca ataaatgtga ttcatcccat aaacagaact 

61441 aaacacaaaa accacatgat catctcaata gctgcagaaa agacttccaa taaaattcaa 

61501 catcctttca tgtttaaaac actgacaaac taggcattga aagaacatac ctcaaaatac 

61561 taacagccat ctatgacaaa cccacagcaa cactgtactg aacaggaaaa gctggaagca 

61621 ttccccatga gaattgaaac aagctaagga tgcccactct caccactctt agtcaatata 

61681 gtactggaag tcctagccag agcaatcaag caagagaaaa aaaaaaaaaa gaaacaaaat 

61741 acatccaaat gggaagagag gaagtcaaac tatctctctt cacagaggac gtgatttatg 

61801 tctggaaaac ctcattgtat ctgtctaaaa actcctggat ctgaaaaaca atttcacaac 

61861 gttttaggat acaaaatcaa tgtacaaaaa tcagtagcat ttctaaacat caacaatgtc 

61921 caagctgaga gccatatcaa gaatgcaatt ccattcaaaa tagcaacaaa aaaaatacaa 

61981 tacttaaggg tacagataac tggggagtga aaatctctat gacaagtatt acaaaacaaa 

62041 gatcaaagaa atcagagatg acacaaagaa atggaaaaac attccatgcc catggatagg 

62101 aagaatcaat attgttaaaa tggccatact tcccaaagca acttatagat tcaatgctat 

62161 ttctatcata ttactaatta cattttcaca gaattagaaa aaactatttg gaaattcata 

62221 tggaaccaca gaagagccta aatagccaaa gcaattctag gaaaccaaaa agctgaagta 

62281 tcacactacc tagcttcaaa ctatgccata aggccacagt aaccaagaca gcatagaact 

62341 agtacaaaaa cagacacata aaccaatgga ctagattaga gaacccagaa gtaatgtcac 

62401 acatctatag tcatctaatc ttcaacaaag tagacaaaac aagcaatgtg gaaaggattc 

624 61 cctattcaat aaatagtgct tggataacta gctagccata tgcagaagat tgaaactaga 

62521 ctctttcatt ttaccatata caaaaatcaa ctcaagatgg attaaaqact taaatgtaaa 

62581 tcatacaact ataaaaacct agaagaaaac ctagaacata ccattctgga cataggccct 

62641 ggcaaagatt tcatgacaga ctccaaaagc aatggcaaca aaaacagaaa ttqacaaatg 

62701 ggacctaatt aaactaaaga gcttctgcac agcaaaagaa actatcaata gagtatacag 

62761 acaatgtaca gaataggaga aaatatttgc aaattatgca tccaacaaag gtctaatatg 

62821 cagaatctat aaaaaaaaac ttaagcaaat taacaagcaa aaaacaactt ctttaaaaaa 

62881 aatgggccaa aaaaacatga aaagacactt ctcaaaagaa gacatgtgtg accaacaagc 

62 941 atatgaaaaa acgctcaaca tcactaatca ctagagaaaa gcaaaccaaa accaccatga 
63001 gatatcatga cacaccagtc agaacggcaa ttttaaaagt caaaaaataa cacatgttgg 
63061 tgaggctgca gagaaaaagg aacagttata tgctgcgtgt gtgaatgtaa attagttcag 
63121 acactgtgga aagcagtttg gagatttccc aaagaactta aaatagaact accattcaac 
63181 ccaataatcc catcactggg tatataaagg aatataaatt gttctactgt aaagacacct 
63241 gcattattca cagcattatt cacaatagca aagacataga accaatctag aagtccatca 
63301 gtggtggact ggataaagaa aaaaatgtgg tacacaaaca ttacagaata ctacacagcc 
63361 ataaaataga atgaaatcat gtcctttgca gtaacatgaa tggagctgga ggccattatc 
63421 ctaagggagt taacacagga gcggaaaacc aaataccaca tgttttcact tataagtggg 
63481 agctaaacac tgagcacaca tggacacaaa gaatggaatg atagacacca ggggctgctt 
63541 gagggtggag ggtagacgaa gggtgaagat tgaaaaacta ccaattgggc actatgctta 
63601 ttacatggtt tatgaaataa tctgcacacc aaactctcat ggcatgcaat ttacccacgt 
63661 aacaaacgta cacgtgtacc cccgaaccta aaataaaagt tagaaaggaa aaaaaaaaaa 
63721 ctaaatccct atctcacatt atacatctaa aaaatccaaa tagattagat ttaattaaaa 
63781 attaaaccat aaaacacaga aagaaaatac agatgaatag ttatataatt taggatgaga 
63841 acagcctctg tcaggatgaa ctaaagaatt cacaaaaaaa aatgttaaca aattaaatga 
63901 catgttaaaa atctggccag gggtagtggt tcacacctgt aatcccagca ctttgggatg 

63 9 61 ctgaggcggg cggattacct gaggtcagga gtttgagacc agcctggcca acatagtgaa 
64021 acctcgtctc tactgaaagt acaaaaatta tccaggcatg gtggcacacg cctgtaatcc 
64081 cagctactca ggaggctgag gcaggagaat tgcttgagcc cgggagacgg aggctgcagt 
64141 gagctgagat catgccactg cactccagcc tggctgacag agggagactc tgtctcaaaa 
64201 aaaaaaaatt aagaaactat atattgaaag actataaaaa atagttttgg tttttttctc 
64261 agtaaaaaaa taatatatcc attgtcaaaa attttctatt ttcaatatta ttatttaata 
64321 gaaaaatttc aacaatttct ggagaaaata tatgaaacac atttgacaga gttttaacat 
64381 ccttaacata aaggatgtaa cttatatgtt acatccttaa cataattttc tttgttcagt 
644 41 taacaagttt ttattgagta tctatattga aatatgcact atgtctgaca ctggcaacac 
64501 atttgaggaa ataaagaaga aaaattcaag tccacatgga aaaatggact aaggacatga 
64561 aaaagtataa atcatcagta cacttctaat aaagttaata atcaaagaaa tgcaaataaa 
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64 621 tgtgacctat caaggtcttg ctatttgtac tgtcaaatta taaaaataca ataacaaata 

64 681 ttggcagata ttggggaaac aagtaatgtc atatactggt atgtgcttaa actggtaaaa 

64741 gtcttcagaa aggaaattgg aaatatatgt aaaaattttg gacagttgca tattctttgt 

64 8 01 tctagaaatt ctacttatat gattagaacc taaggaagtt atcatagatt taatcaaaga 

64 8 61 tagttacagg atattcaaca tagccctgta tataacagcc aagaatctga agccaacaaa 

64 921 aaatatttag cttaaaatgt atagaatata cataggatga aattgcatag gtcactaaac 

64 981 agtttctagg gacatggaaa aaaaaatgcc atattgcaga aatgcaatat tttgggatga 

65041 taaaagggca gtttcaaaaa attgaaaaaa tagcagttac tatgacctct tttggcaaaa 

65101 ataattatat gtaatattgt tcatggaaaa atagactgga aatataacga ttaaaatgtt 

65161 aattgtatct gggttggtga gattacatat aattattata tataatcttt acttttttag 

65221 attgttgttc acagtgacat caagatttgt catgggagaa aaaatgaaga agattattat 

65281 tttgtagtca cccaaggaga gaagaagtca catctctgga gataggagga gatagggcat 

65341 atgagccaat cacatttctg cagccttttg tggatacaaa aggggtgggc caagcctttc 

65401 caatatccct tttcctcctt ggtacccaga cagactatat ttaccagcct cctttgcagc 

654 61 caggtgtggt catgtgactc attcctggtc aatgaaaggt agatggaagc ctggcctagg 

65521 aaacaggtct acatgcaatc ttccattctt ctcccaccca ctggctgaaa agaaaggctt 

65581 ctgaggacac agagtggggg gaaggctagg tagaggctgg atgggaagaa agcagggttc 

65641 tcttgctcct ctcatcagag agctactgag gagacccacc gcacccagac ccccatactg 

657 01 gactgttaca ggagtgagaa ataaactttg tttagaccac tgaaatttga gagttagagc 

657 61 tattagccta ccctgattaa cgctggggta gcccctcaca tctgcccagc accctcagca 

65821 gagcctaacc caatcatcca aggttaccca atagagaact caggcttcca ctagagtagc 

65881 cgtcacatag agttactggg tcttccaggt tctgccacct ccaccgtgtt ttgtgggact 

65941 ctccactgaa ctcacaaagc acatgtattc agctactcca aagctatgtg tactcaggaa 

66001 gagaaagttc taaatgtttc tgtgaccttg tgccactgat atcatgggtt ctatccttca 

66061 gagtgctaag aaaagtgtta aatctccaga atagctttgg aaaagtggag aaatttgaga 

66121 ctttctctat ttttagttgc taatatgcat tcctgaaaag tttatttaaa gccacaagga 

66181 aatatttcca tattgaattt caatattgca atatactatt ttagaataaa atggaaagat 

66241 ccaggacata gagcagctga gaagctacac agttggcccc tgggcttcac tcagacagtg 

66301 cttctctagg ctgcaagtca taagtctttg tgtcacccac ttttagaaga catcccccgg 

66361 acttctggga ccaacctgag gtttgtagga atctctgagg atcaatgatc aaacatatat 

66421 tagcttgaaa aaaaactaat atatgtgtat gtttttttaa aaggttaagt ttttaacaat 

664 81 ttagacgtct tggcattata ctctgagaac gttaaccaat gagaataaga gtgacaattt 

66541 agatgtaaaa cctcttaaag gtaaagataa aatcctaacc acatggggaa aaccatgtgt 

66601 tctgtggatg atttccctgg atttctcaaa agactcttag aatgttctgg gtatgacttt 

66661 gggggcatct gttctatgaa gatgggcacc ccagcatcac ctgcatatta acccttcttc 

66721 ttgctcctaa agatgcagaa ttagctgaga ttaggaagat gtacaaagtg ttctccctgc 

66781 acttcaacta atgaacaaaa ctttttaaaa attacttatg atttggaact attttatcat 

66841 tttatRtatc ctatggcaac tggcatttta tatttaattt tatcatcttt tgcttttttt 

66901 ctttgatcta aatcaggatg ctaagttttg ggggggggtc ctacagtttt ggcagagacc 

66961 tcattctaaa aaagaggtac cagactatta aggcagaatc tgctgagggt ggccccaaac 

67021 ctggtctggt gaacctcact caactcatca ttgatttcaa gggcacagaa gaggcaagga 

67081 gatgacagca ggagaaatgg acatgtacca tggagaaagg ggaccaacca caccttctgt 

67141 ccaaactgag atttcccagc ctgaaacagg tcaacactgg ggttagcaag aaactttaaa 

67201 ctgaatagga gattaaagta ataagaagat gagtactatt attgKtacaa attatattgg 

67261 taatagtttt aagtttcatt ctttaagaca gggtgaatca cagtcccttc tctcaaggtc 

67321 cttgtcagca ttctactttt ccaaggacat tcaaaaattt ctcatccaaa gtcaaacatt 

67381 agtgtgtgtg cctcaaacaa gaaatagttt tccatcagtc tccaaatctt tgaatagtaa 

67441 atggcaaaag tatggaaatg caagatggag tggggcttaa tgttgccaca acattcctac 

67501 aatactggga tagtcttcct tgggaattac tttctttcct atcacgtatc aataaggaaa 

67 5 61 gccaaatgta gaaataaaga aatctccccg atttacttaa cacacggagg tgacagttac 

67 621 agcgcaaatt gagggaagag taaatgattt cMacaaatga accaaaaaca cttaaaagcc 

67681 acatttaaga gaaatggagt cataaaggca atatttgtga gcagataaaa aatcagcaca 

67741 gaaaaaggtg agaaatacac aatgtctttc atcaaggtgc aaggaaatgt aaaataacat 

67801 taaaagtacc aactacataa tttgaatata aagagaacaa tagcaagaaa ggagagggga 

678 61 agtgatggcg atatggagac acagtctctt ttgcatgcac agggcagaac cggtgcctca 

67921 ggctggaggg tgtctgtgag taccccacaa tgaaRgcaaa agagcMccca gcattgaagc 

67 981 cacagacctt gctccatgaa tcctagctga tgggctggca cagagagatc aggagcagaa 

68041 caaggaaatc cataggagga tcctcccaaa gtactttttt ttctggtcca aaaatcctta 

68101 aatctgtcag aagcaggagg ttggagtggg agaattgcag agcctggaac acacctgcct 

68161 gctgatgacc cattctgagg gggttctttc cacttggcag caatggactt ggtaaaaaaa 

68221 ataatagatc cttctccatg gatcccattt cccttttttt tttttttttt ttttttgaga 

68281 tggagcccca ggcagaatta aagaccaact ctgggaccca aatttttaga ggacaaaggt 

68341 tgatataatg gccagcacga tttagtcatt tagatttcag gtctaactaa ctgatgacag 

68401 actttaacaa aatcactgaR agaaatttaa cgtcaacctt aaataacagt agaggcaggt 
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684 61 aatataacaa atagagtcaa aatggtttat ctaaaggtct aagcccaaga aacatccaac 

68521 aataaaatta atctaaaaca aattttttaa taatcatttt tttttcttga gacagagttt 

68581 cgctcttgtt gcccaggctg gagtgccatg gcacggtctt ggctcactgc accccctgcc 

68641 tcccgggttc aagtgatttc ctgcctcagt ctcccaagta gctgggatta caggcatgcg 

68701 ccaccacgcc cagctaattt ttttgtattt ttagtagaga tggggtttca ccatgttggt 

687 61 caggctggcc tcgaactcct gacctcaggt gatccaccca ccttggtctc ccaaagtgct 

68821 gggattatag gcgtgagaca ccacacccgg cctcatcttt tctttaacat tacaatttcc 

68881 catcatggtg gcacttgaca gtagtgggac tgtccaacag caaatgaccc catctaagca 

68941 ctcaggacag aagccatttt aggtcatgaa aattatacag attttgtgag tgaagggcat 

69001 ttagttctga gcagaaagta gtcttgcaaa agggaaattt gcaacaacaa caaaaaagga 

69061 caaggagggt gacaatgtca agcaaatgat aaccctgctg gagcttcaga aggaaccagc 

69121 tacatccctt tcctgcttct acagatgcta tggcagcccc agaggtgtcc cactaggatc 

69181 tcccttcaag aaagaatcta gcatgcatct gaacaacctc cagctgttca cacctcaggc 

69241 cctgcctcat cccagacagc cctgagccaa tcactgacac agcagggtac tagagcctgg 

69301 tcatttctgc ccaacccggg gcccctccaa aggcaagctc tgctccagac ctccccgttg 

69361 ggttggtcaa actatcaaat ctgtacctgg tctgaggctc tccctgccca gtcttgcttc 

69421 atcctcattt tatccttcac aggcacagcc cccttccccc tccaataaaa cactaacatt 

69481 cctaattccg tctcaacatc tgcttcccag ggaacctaac taacacaata ctcctaaaac 

69541 caataccagc aaaaagtgac tttccaagcc cctacatgaa gactcatttt ctggaaaata 

69601 aaatttaata taattccaat tacccatgac gctatttagc catttcaaag gaaatgagaa 

69661 agcttcccaa attgtggtcc tcatggagtg aactgatcat caaatgttgc tgtttacatg 

69721 agggcacaca ataattaggg ttgtatgtaa catgctactt tcagattttc aggtaagtct 

69781 aaatgctttt aattgatgca agaataataa gtccttcagt tgatttatgt catgcattaa 

69841 tatcaatcaa aaggttttct aacacatact ttttaatact ataaaatgtg gggttttttt 

69901 taatgtcttt tttttcttat tcagagccac atttcttcac atggtatgtt ttacctaacc 

69961 taagggttat gttgttcgtt tgtttttaat ttcaggtttg aggaatcaag taacaggtct 

70021 ttctaaatcc ttttattgga tatatgttag ttttccatta tcataaaaga ctgaggaaca 

70081 ggaacagcag catattgttt aaatgattca caggtaaaaa tatcttctcc ctcttggtct 

70141 gttttctttc ctgatgtctt tqctcattqc caacattgtc tattttatat ataattcttg 

70201 caggacattg tgccctgaag gtgaaRttgg acaggtaagg agatgaccca acaccccttg 

70261 tcacacctqc aggctctagg tcacttttgt aacacacaat gaaatgggta aattctacac 

70321 aatccttttt tttttttttt caaacaataa cagaaagtaa atatttgaac tgggggcttt 

70381 ttcaaacaac acaagtcaga aaaaattgac ttgtttttga ctgttacttt aggcatatat 

70441 ttttgattaa tgacaattaa atgtaaatat aaagatattt taaacttgag gttttaaaac 

70501 aaagaatgca agaatcacct ggttaaagtt agatattttg tcagtctcaa agtcagttgg 

70561 ctggataata ccatatcata aaattttata agactaattt gaaggtccat atgaattggt 

70621 attattgttc gtttttcaat attgatctta gcagtgtatt tacagcatac agcagagtta 

70681 atgtttaata caaacaaaca agactttcca gaaaatctac aaagaatcac taactaatag 

70741 ccatgcttca gttaaatttg tttacttttt aaagtacatt ctcagaagta acatgtactc 

70801 tcaattaagY aaaacacaac cagaattttg gaatttgatc tttaaaataa taaaaaaact 

70861 ttctctgcta tttacagcct tggttaaaat cccaacatct ggataccaga atctgagcag 

70921 caaaatccac ggagacacat gtgcaatttt atgcctgcaa gctcactatt atttagaacg 

70981 aaaccaacgt agtgcatcta cccttgtttc ttgctacaca gaatgtatcc agagtacaaa 

71041 ctaatgtaga ttattataaa actcagtgtg tagatattat atatatatac acatgcatac 

71101 atacacgcac atatataact aaaacaaaat cttcaaaaat aatacttttt ctttctgtat 

71161 gtgatgtgcc ctgatagttt ccactttact ccatcccatt tccttttgtt aatactggcc 

71221 ttgactcact tagacaaatg ggtcatgacc tgtagtttgg aaaacagggc tttccagcct 

71281 ggctttaaag aggctggtcc tcaggccagc agaaccagca gcacctgggg gtagtttgtg 

71341 gcgcacaatc tcaaggcctg ccctagacct actgaactag gatctgctct tagcaaggtc 

71401 cccaggtgac ttatggatgt attaaagttt gagaagccgt cctcctgcat gcaaaagtac 

71461 agaatgagta tctgtgatct gaactacatt taaatcgata atgcacctct taaactgtct 

71521 ttgttcaaag aacttctatc ctactgagct caaggagcaa cacagtttct agcaagatag 

71581 gtgcaaagaa aaattcactt tcaatttcat caactgcttc aagaggttgc agggactggc 

71641 tgacctggta tcccagctcc tgcaggaccg agagtgtggg aggtgggttg aggggaggta 

71701 ctgaattgac tgcaatagat ccatgagcat ataaaagtgt gcgtgtttgt tcttattcct 

71761 gggagaaaat atatagtatt cacctaattt taaaagagtt ttatggttta aaaagttaag 

71821 aagagctact aagccatgaa aagacgagga ggaagcttaa atgcacatta ctataccaga 

71881 taagaaaatc tgaaaaggct acatagtgta tgattccaac catatgacag tctggaaatg 

71941 gcaaaactat ggagatgtaa aaagatcagt ggttgtcagg gcctaggagg tcgggaggga 

72001 tgaacagaca cagctcagtg aatttgaggg gcagtgaaac taatctgtat gatattataa 

72061 tggtggatcc gtgtcatcac ctgtttgtcc aaacccacag aacatataac accaagggta 

72121 tacggtgttt atattaaagt ataaccctaa tgagaactct ggactttggg tgataatgat 

72181 gtgttgatgt aggttcagcc cttgttacac gtcagccacc catgtgggaa aatgttgata 

72241 gtggtRgaag ctatgtatgt ataggggcag gaattatatg agaacacttg tggtatatgc 
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72301 


tcaagtctgt 


tgtgaaccta 


aaactgctct 


aaaaaataaa 


gtctgtaggt 


atctatgaaa 


72361 


gcgcagacac 


atggtccgtg 


agagcacgtc 


ctctctgcta 




gtatggttac 


72421 


aatcatacct 


agaaaggaga 


gaaaataaag 


agtaagtgaa 


taccgacatg 




72481 


ccccagaata 


caacaaagtc 


cctgaatcac 


tcttgtttca 


ttgttgccat 


tgcttattca 


72541 


attcagtcaa 


ctcgtatatg 


tgaagcatgt 


gctacataca 


taacgtataa 


tgcaggatgc 


72601 


aaaaatgatt 


atgtgatcac 


ttcaagtaat 


ttataatcta 


gcaaacatat 


aaaacaggca 










ctttttcccc 






72721 


agcaaggatc 


cgaqqacaqa 


qaqqcaaaat 




aaaaaqacat 


qqaaccaaaa 






ctcqcqaqta 


aggtatcggg 


agagagtggt 


gtacaacgtg 


acaqqqacac 


72841 


ctgttgctcc 


caattqtaat 


cttttctgat 


tggtttatgc 


ccctttccaa 


ttactqtqtc 




catgctaatt 


atcaactatt 








aqaaaaaaaa 


72961 


aaactgctgc 


aaaqtqatqc 


tqtqttcatt 


ttqcacaqaa 


cccaqaaatc 


ctqatqaaqq 






aaataccaca 


^ ^^^^^^^^ 


cctqtqaatq 


ctqaatqact 


utgucggugg 


73081 


gttttacgat 


aaqatqtact 


ttcttttata 


tctattaaac 


aaqctataca 




73141 




aaqtaaaaaa 


aaqaataaaa 


taqataaaat 


tttctaaqtt 


aaaaqctaaq 




qqaaaaataa 


aaaqaattat 


caattqat t 


qatcaaaqaa 


aaat tttat 


ctcat t 


73261 


attaaaaaaa 


aaacatcata 




aataaaataa 


tcatcaaaaa 


caatct^cac 




aacaaatatK 


acataqctaa 


qqqaactat 


cattaaaata 


taat tatac 


aaatt ct a 


73381 


gaaactcact 


acaccttcta 


aatcatqtca 


aaaqaqttqt 


ttqcatataa 




73441 


cttcttctct 


ctctctYttt 


ttttttqttc 


atattcactg 


caaaqt 


ttt t 








aaatcacaat 


gatgtccagg 


tqaccaaaac 


ctqtaactaa 


73561 


tcctcagtcc 


tcatctcacc 


aaaacaacaa 


tttcttagcc 


cttctcctac 


ctccactoac 


73621 


ttctcttctc 


aactcctctg 


taattccccc 




aacctatata 


cattaaaqaq 


73681 




caaccctaat 


QCtcttctgc 


atcttgatcc 


ctaaataatc 


tcacctaqtc 


73741 


acccagcctc 


aaataacctt 


aggcttagaa 


atcacaaata 


tatctctcac 


ataacctcta 


73801 


gac teat ate 


aacctctcaa 


catcacaatt 


toaatatcca 


acacrgcatct 


ccaactcaac 






cccacttttg 




c tt 


t t ft 




73921 


cttttagagt 


aggtggcacc 


atcatatacc 


caqttacaaa 


caaaatcttt 


aaa^tcatcc 


73981 


ttgactctcc 


tacactgcat 


atctaatcca 


tcaqcaaatq 


ctgtcttttc 


caccttcaaa 


74041 


gtagatcccc 


agtctgatca 


tttctcacta 


cctccccttt 




tqcctqqcaa 










ctcttqgtcc 


ccctctagac 


tqttccccac 


74161 


acaacaacca 


acatggttct 


tttaqaactt 


caaccaatqt 


aaattatact 


tcaaqaaaqc 




tLLaaaaaat 




cacatctgct 


atqaaccttc 


caat ttct 


cctcccqcaq 


74281 


tqtqccaaat 


aaaatqqaaa 


tqcccttatc 


cttqcttaac 


aqaaqtctac 


aacacaqcct 


74341 


caccagcatg 


cctccaactc 


acatcctqca 


qctttaacct 




caqcccccaq 


74401 


tctccttgcc 


qttacttqaa 


aqtqccaqga 


qaqctcctqc 


caaaqqacct 


ttqcacatqc 


74461 




ctctqcaata 


ctcttccccq 


tqtqatcata 


cqqctcatqc 


tcccactccc 


74521 


ttcgggtctc 


agcttqaatq 


tcacctcctc 


cctatqacct 


tccctaacct 


ttcqtqtaaa 


74581 


attctacctc 


tcccttcctc 


agtctcttgc 


ttctcctqct 


ttattctqqa 


qaataccNaM 


74641 


cagcaccacc 


tgatttgtaa 


ctattccctt 


qaacacattc 


tatctcaacc 


acoaaaatqa 






caggcgggga 




tea ttc to 


ct taaccca 


gcaattctga 


74761 


atgattagat 


tgaaattgga 


tccaaaaata 


tcacaaataa 


qataacqtaa 


cttgttgata 


74821 


agtgtgtgaa 


gagtatgaag 




gctcacaagt 


ttccaaqtgc 




74881 


aaaactccat 


tgtcacccat 


caaatcagca 


aagattaaaa 


accacaataq 


aatqtqataa 


74941 








tatagatggg 


ca aatt 


t atct ta 


75001 


tcaatgatca 


ttctttcata 


ccaatacacc 


atgggatttt 


qcqatcaqaa 


q taaa aa 


75061 


acacagaagg 


aaaagaaggt 






cccataataa 


qtaqqacttc 


75121 




ttagtaagaa 


agagttgggt 


gtccccatgg 


gaaaatgcta 


atttaqtqtc 


75181 


ctgaaccact 


gtaccctcca 








cactqaqtcc 


75241 




ataatgatcc 


tcaaccccca 


attgcacaga 


cagggctaag 


tcaqaqacaq 


75301 


tgatgccaac 


cttgctgatg 








ccctaagMaa 


75361 




tccgattcct 


aagttaattt 


cctataaggt 


ggaagaagca 


acaagtgtca 


75421 


cttctcttct 


gggctctatg 


acaccaaata 








75481 


aaacccaagg 


ccaaataatg 


ctctctaaac 


ccttagaggt 


gtgactagga 


tattcacaga 


75541 


agccacgcaa 


aagacagcag 


ccttcaatga 


agaaacaaag 




catggttcct 


75601 


tttataaata 


tttatcgatt 


atctaaggct 


aggttaattg 


ttgataacta 


tcaattatct 


75661 


gagcaagata 


cgcagaaaca 


cttcaggtgc 


ttgaactgca 


gcatcctggg 


ctccaaccca 


75721 


aacttactgg 


agaagaagct 




ggaatctgca 


ttttaaccaa 


ctcctcaggg 


75781 


aaactgcccc 


agagggggtc 


taggctaacc 


atgcgcaact 




ctcctgggaa 


75841 


gtgattgtga 


atctcctcca 


aacccctctt 


acacttcctg 


attaagttgg 




75901 


tgaatgtcag 


cctcactgaa 


tcttaacgag 


gaatcgccag 


actggggggg 


gNattgtgca 


75961 


gcatgtctgg 


ggaacacttc 


actacttcac 


taaaccctgc 


tgcacacatg 


ctgaacaccg 


76021 


ttccaaaggc 


tacaggaact 


gagcaccgca 


ttggaatctg 


gccctggagg 


gtgcccagca 


76081 


tggcgaccct 


gaggggtRca 


ggaacccagt 


gagaaggaga 


aacagaaggt 


acagtccagc 
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76141 


tcgagaccac 


gcacataaag 


aaatctgtag 


gcagggtctc 


tattattgtg 


ccatattaaa 


76201 


tatcttgata 


ttttaaagca 


atattatttt 


tctgattata 




ttqtaqaaat 


76261 


tttaattgtg 


ccatcatcct 


gcagaaaaat 




aaaacqaaaa 


aqcaqctata 


76321 








atatactaac 


caoacta 


act atcaaa 


76381 


gctgatttat 


tgagaaaaaa 


taggcagcat 


atcaaagact 


tacatcaact 


teat 


76441 


cagactttgg 


gggcggggga 


actctgtctc 


ttttgagtat 


attctcaaaa 


t tt 


76501 


tcaccttcct 










gga ^^^^ 
gaaa gc a 


76561 


caactcaatg 


agtcttttgt 


cagccagtgg 




. ^ " 




76621 


tagcgatggg 


ataaatcatc 


accagccccg 


agacatcgcc 


taatcccttc 


ctcqcaaqqc 


76681 


agattcgggc 


gctttctgca 


tctgcttcgt 




aqaqctcqqc 


tcatctqcta 


76741 


ggctctcagg 


cttcacagtg 


acctcgcaca 


qaacqctqct 


acttctqaaa 


ctqttttaaa 


76801 


ctcttccgtc 






taaqtttaqc 




tea aaata 


76861 


tttagcgtta 


cttccaqaqc 


qqcaqccqtq 


aaacaatqtq 


qqaaaqaa t 




76921 


acttcttggg 


aaqtactqac 


cctqqqtcaa 


tttaattttt 


tccqtaaqat 


ttc ^^tt t 


76981 




cttccactqc 


tatat taac 


a tatctccc 


cttccact c 


t'^t'^t ^ 




aqtatctccc 


cttccactoc 


tatatqcacc 


aqt 


tatacaacat 


tata ^tt^'^ 


77101 


taatatgcta 


tattgtttaa 


aaatattaat 


ttatataat 


t 


t^catattat 


77161 


tatattgaca 


atacgtaatg 


tqatqtqata 


ataaqtctca 


ctaaatqcca 


qqca 


77221 


tgaatgctct 


aaactcatta 


qctaatttat 


tcctcataat 


tctataqaat 
9999 


aqatqcacta 




tttcattccc 




toaaoaaact 


aaaocaoct 




g g g 


77341 




atcatctaac 


cccaacacac 


tcataaaaat 


aaacaa t 
g g 






aagggaaaag 


gaagccaagg 


aataa 


ct 


acaaa cc g 


ggtaagctgg 


77461 


gtttttagtc 


cttccccaca 


tataaacaac 


a 




actttgtttc 


77521 


ccctglicatc 


cattgttggt 


gcctgatcct 


cacacaaaat 


cttacqqaac 




77581 


gattagctta 


gctgaattga 


gcctggacac 


ccttcatqaq 


ctqqaqqaca 
gg gg ca 


^aqqacqt^t 


77641 








etc cct ct 


acagg tgagt 


^ acct t^ 
ggacc a ga 


77701 


acccccagcc 


tcatcaagca 


cccaaoaatc 


acctqcctoc 


cacctgga 


agagacagca 






ccaqqaa at 

ctqtcR^ca 


cctct 


tt 

gaggc aca 


ggcagaagca 


gcggctgctg 


77821 


aacatagatg 




acaca 


tgataggggg 




gagcaggtga 


77881 


aggtgcagac 


tacaggcayc 


ccccctaaaa 




accccaqca^ 


cagctcccct 


77941 






ctqtqcccga 




aacaqqt ac 
gg gac 


cca qccc g 


78001 


agtggcagat 


qctqtqtrtt 








cccag gggg 


78061 


tacatggaag 


gtttgctaaa 


t ttt tt a 


at aataaac 


a cccaa'^t 


cagaagcag 


78121 




ttcctcatcq 


caaaaqqqtq 


cqqtqqaaaa 


actcacaaaq 


c aaggcagg 


78181 


aqqacacact 


caacttgqaa 


acqqc t 


cccaqcttt 




agcc gage 


78241 


tcaaqqaaqt 


cacttqacca 


qcaqtaqctc 


ccacctttcc 


tt c t 




78301 


qaccaaqtqa 


tctctaatqt 


qatqatttqc 


taaaatacaa 


^ tttt^^ 






taccttcctc 


tcaaaactaa 


cttcactat 




ttttt t^t 




78421 


ggtatttttg 


ttgcaaatag 


tacacacaMa 


acagga a a 


t 


*^ ^ttt 
g gcaa a 


78481 


aagaataaga 


agcaaacacc 


tttgcaKgta 








78541 


cctcagaagc 


cccttatgcc 


tttcccctaa 


aactctcccc 


tccctaacaa 


aataattcc^ 


78601 


atgctgagtt 


.atgtggtttt 


ttttccttat 


ttttcttttc 


aacacatatt 


acattt t t 


78661 


gaataatatg 


ctttttaatg 




''a^aaccatc 


g ggaa 


catatt'^t'^t 
ca a g a 


78721 


gtaatctgtg 




actcacatat 








78781 




tggagtttat 


ttttcactqc 


tqtata''tac 




a ta*^ t^ 


78841 


ggtttatgta 


tccacttttc 


tgtagatgga 


cactt^^ tt 


ttct tttt 


t ctcataca 


78901 




tgctcaggta 


caagggaata 


caqacccaqq 


qtqaqactqc 


tgqatcacaq 


78961 


gctaggcaca 


tcttcagctt 


tactaggtaa 


caccaaqcca 






79021 


actcatgctc 


ctgccagcat 


ggaggaccca 




taaaaacatc 


actqttatta 


79081 










caaccacttc 


tcaaaqcaac 


79141 


tagccagcat 


gtgccacaca 


ggcaagcacc 


cactccaatg 


tgatctatat 


ggttaggtat 


79201 


gtatgtaccc 


tggaaaaccc 


tgtgatgttg 


cttgaggagt 


atataacttt 


tttatqtatq 


79261 


taagtagcat 


gtgctacaaa 


gcgcgttctg 


ttttctgtat 


ttttcactcc 


actcagtgtt 


79321 


ttgtaaaaca 


tgctcatgtt 


gctccatagg 


tatcagagac 


aggatgatgt 


tttatcttta 


79381 


agatgggtac 


gccaccctca 


ttttatagat 


cacagtgata 


gctcacaatt 


gtttagtact 


79441 


cagtggggtc 


taaatgtact 


aactcgttta 


atctcccacg 


aaccctgcaa 


aacagatgct 


79501 


tccgtttcat 


ataaaatgag 


accaaggcac 


aaagaaacta 


aggaacttgt 


acaaaatcac 


79561 


caagttagta 


aacatcagag 


caggatttga 


attgagccag 


aatgtgttct 


tatccacgac 


79621 


atgaaactgg 


ataagaatgc 


aaagctcaga 


gaattttcat 


gactagcatc 


tctgggattc 


79681 


acagctggcg 


accacggagc 


agaactaata 


tacagatggt 


tcccctctca 


gtccagtcct 


79741 


cattccatcg 


gctccaaaat 


gtgtattatt 


ctatatgatt 


agctggtgaa 


caacaaaggc 


79801 


aattagttag 


tcctcttcag 


caaatgttag 


taaatgaaaa 


tafctatagag 


acagaactct 


79861 


ttttaggaaa 


tgcaatcatg 


tgaatttaaa 


aatgctttat 


cagaaataga 


attaatgaca 


79921 


aaaatattta 


taggatccta 


agaaaatgga 


tttgctattt 


tttttgtcgg 


ctgatgataa 
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gacagc 


ccacccc 




Katcttcata 


80041 


cacacctaac 


ata 


aga g gca 


ga a gg a 


'^tta^tataa 
g g a 


tttctgtaat 


80101 


aaaaaaaqta 


cttqqtcatq 


c g aa cc 




ggaggctgag 


gtgggaggat 


80161 


t ctt a cc 


ca a ttt a 




t 

ggcaaga ag 


tgagaccttg 




80221 


aaaaataaaa 


att 


ttt 

g g yg g 






t t 

gg yg 


80281 


gaggcagga 




agcc gggag 


gcagaggc g 


t 

ggg 




80341 




agcctgggca 


acagagcaag 


ac a g c 




aa aaaaa a 


80401 


aataataaat 


aaaaaatttt 




acc ggaacg 


ggg 


aaagggaaa 


80461 




tt^ttt^taa 


tcactattat 


cac gacaga 


gaagcagc a 


aaa aaacag 


80521 


cct^ctcatt 




t 








80581 


ttttatattc 


cata ttttc 
ca ag c 


aaccacccct 


ccttttttta 


tt 

g ccag 


^ '^^^^^^ ^ 
c cc aaac 


80641 


a a aaaagg 


a aaaagcca 


ttt 


cagaaaagag 


ca ca aaa 


gcagag 


80701 


gaa c c 




ggaa g cc 


c cc ccag 


gacagcc ac 


ggacaggac 


80761 


g ga gacac 


agag c ca 


cgggaa cac 




aagccac 


c g ggaaa 


80821 


ac caag cc 


t'^tt^^t ^ 






a gaaccaaa 


aacaaaa ct 


80881 


ccac a a 


a g aa ac 


a caaaagga 


acca ca g 


gcca a 


aagaac cc 


80941 


ccggccag 


gtgcggtggc 


cacgcc g 


aa cccagca 


c gggagg 


ccaaggcagg 


81001 


gga cacga 


ggtcaggaga 


gagacca 


cc ggc aac 


acgg gaaac 


cccg c c a 


81061 


c aaaaaaaa 


aaaaaaaaa 


gcaaaaaa 


agccaggcg 


ggtggcaggt 


gcc 9 ^9 


81121 


ccagc ac c 


aggaggctga 


ggcaggagaa 


ggcgggaac 


ccaggaggca 


gagg gcag 


81181 


caagccaaga 


tcgcgccact 




c gggcaaca 


aagcgagact 


ccatcaaaaa 


81241 


aaaaaaaaaa 


aaaaaagaac 


cc ^^^^ 


gaaa ca ag 


cagcgttcct 


cagcccaaag 


81301 


ccagaagaaa 




a gaaa 




ctcccatatc 




81361 


a gaa cag 


ggaagaaaa 


^ '^'^^^^^ 




caagaacccc 




81421 


gcagaaaaaa 


gaa acaaa 


aa aa a 




ttggtgggat 


ga agg 


81481 


a aaagga 


t^tt^tt^ 


aacacaa 


c acc eg 


cagccaacat 




81541 




c a gcc 




ccaaacaca 






81601 


^^^^^^^^^ 


aaaac c 




acatY^^^^ 




a a gaca 


81661 


""ataatt ta 


ga ga ga a 


ttatatatt 


^'^'^ ff'^'^f 




ccaa a ga 


81721 


qa aa g a 


gggaaaagaa 


a a a at t 
agagaga c 


t^tactttaa 


acaatt^ctt 


t^ct^a^at^ 


81781 






gaaaaagacc 


^'^ ^- 


acaa gc 


gc gaga g 


81841 




g age gc 




gccc age 


eac gacc 


caacc ggag 


81901 


ctcacaaaaa 




a aaaa caa 


tgtgtaaggg 


atctagggct 


gtgcaggacg 


81961 


gee g a 


aca g ac 


aagcag a a 


cttggtaaaa 


g ca cgcca 


c c agcc 


82021 


caa aaacc 


aggggcacaa 


gcac gcgg 


aaagccgcag 


ggacc c gc 


cc gaaagc 


82081 


gggg a g 


ccaaggtttc 


ccca g ga 


ag c gaaa 


a ggcc eg 


ggga gaga 


82141 


aagacc gac 




cccgacaccc 


g aaaggg c 


g gc gagg 


gga ag a 


82201 


aaagaggaaa 


t tt 


g gaga ag 


aggaaggcca 


''^'^^ 


c gcccc g 


82261 


ggaac gaa 






g aca g 


/^^^ 




82321 


aaaaccgccc 


t t t 


gg gagaca 


^ acagca 




gac 


82381 


ac ccgc ga 


t ttt 

ga g ggg 


ggagagaaa 




gcc a g gc 


^tttttt^^t 


82441 








^ 


ttttt^^t 


^ t t^*^ 


82501 


tt 




cc gc c cc 






aaa aa gaa 


82561 


aat^ata t'^ 


tt 

aaaac 




agaggccggt 






82621 




egg cccc g 


a^cc^act t 


cta^^aaatac 




^^"^^ t^tt^ 
c g g c a 


82681 


tttcttttct 




tcccacct^a 
cccacc ga 


c agaaa ac 


cc^ tt'^t 




82741 


tcacccctt 


ca^ata^t^a 




agcc aaaac 


ccacag ^g^ 


cct^Raat^^ 


82801 




'^tttctacca 


acacacattt 




^ ttttca 


ttt tat 


82861 


att tttcac 


ctttat aat 


ta cttttta 


aaacct^taa 


taetttctt^ 


ct acttcat 


82921 


catttccttc 


tt t 

gy g g 


gaggggtggt 


gtggaatgct 


aataaaat a 


atttca tt 


82981 


c t 






atttattgtc 


tcaacttccc 


tatqaqcttt 


83041 




atocaocttc 


atagtcaqca 




attaaaaaaa 


qtctttctcc 


83101 


cactttactt 


gtgtagcaca 


gacacactga 


tcaaatatgt 


tcccagcact 


tccgaagaaa 


83161 


ccttatcaca 


tccacaagag 


atttcctact 


tgcacttttt 


ttgataaaca 


caaccatcat 


83221 


agcccaacaa 


aacatcacac 


agaaatcaaa 


ttatagtgtc 


cccataagca 


tctgctcctc 


83281 


tatggccttg 


ggcaaaattt 


agaatcctcg 


acatccctag 


agttaccaca 


atctctgaaa 


83341 


gtgaaggagc 


acccacaaag 


ctcaaggaat 


gcattgaaac 


tcttattact 


tccaattcca 


83401 




aagtaaatgt 


gtgtgtatgt 


gagctgcaca 


tggtccagaa 


tgcctcacca 


83461 


aatgtattgt 


tttaaaatgt 


gtgtacgtcc 


atgtgtaatt 


gtatgtgtgg 


gattttgaaa 


83521 


ggctttttat 


attttattca 


aaaaggtaca 


cttttcaata 


tcacaaactt 


tgctttcaat 


83581 


tataagccct 


tcccattttc 


aattaatgtt 


ttaggcaagg 


aatatacatt 


agtaaaacca 


83641 


accacaccac 


ttagggaaaa 


tgaattagta 


tggggaaagt 


gaagaactgg 


ccatgcagtc 


83701 


aattactttg 


gccagcatag 


atatagttga 


actaaccagc 


acaggtgtag 


acagcctgtt 


- 83761 


tggataaggt 


agcttaagcc 


ccaatttttt 


ctgcagtctt 


tatttttgtc 


aaatatatga 



217 



PATENT 
Docket 524593008700 



83821 


aaactaccat 


ctcatttcct 


ccctgtagct 


attacccaca 


tgtatgtgta 


qtqtcaaaat 


83881 


aaatggccag 




gggacctccc 


ttggcacaag 




tcaccctctc 


83941 


acctccagag 


cc tcaggaag 




aaagcagcca 


qqqcaqqtqt 


qaqqtqatqq 


84001 


agcgctgagt 


gtcggcgagc 


ccagataaaa 




acatcqqaaa 


tccaatcatt 


84061 




tctcctgctc 


gctaacccca 


ggcaagtata 


aaqqaqatqa 


aattataaca 


84121 




gaatttggat 


gtgatttcat 




aacaaaattt 


tttctaaaaa 


84181 


ggliccagaca 


gtaaatialilit: 


tcagctggca 


ggccatatgg 


tctttatcaa 


acctactcaa 


84241 


ctgtgccatt 


atggagtgaa 






aaatgaatgc 




84301 


gttccaataa 




atggacactg 


aaatgtaaat 


ttcatgcaat 


ttacatgtgt 


84361 




gcttcttttg 








attttcttag 


84421 


ctcctgggct 


gtgttgactg 


acccctaatt 


aactqatatq 


tqqqctqcta 




84481 


cctaggcaca 


gaggaaggca 




cccaqccaaq 


acqqqactta 


ccttqatatq 


84541 


agctgctggg 


tccgggacag 


tctgaatcat 


qtccttcaqt 


aaqccaqccc 


atctaccaqc 


84601 


tgttcagaac 


Ntgacggcta 




qqaacaacat 


acataataat 


aaacaaYtac 




atttctgtgt 


tttatgttct 


aqcaqqacaq 


qaMacaattc 


accaatq ca 
ggca 


aaa aatcat 


84721 






tacctaaaqq 


catcctctaq 


guacgugiigg 


tccat^a'^^t 




qactaqataa 


ctcacttccc 


ttccattqqq 


aaaatcactt 




caa a aacc 


84841 


ctagtccqac 


aqqacaqaaq 


aatctaqctc 


aaqctaaccc 


acatcacat^ 


ca^catccac 


84901 


acatccttcc 


acgtgggcag 


catctaatqa 


qqqatccatt 




tqqq 


84961 


aacagggcat 


acttctctat 


catactgaga 


aaacaaaact 


ttqaaatccc 


accaaaactN 


85021 


atgcgatctg 




tcagcttYgt 


tattaaattc 


ttagtaaaca 


gcacagctac 


85081 


cattcattga 


gtacttcagY 




taaqcactaa 


aaactccaca 


cacattttct 


85141 


catttcattc 


caaccacgac 


tctgagaoct 


gtcattccca 


ttatccaact 




85201 






tttatccaaa 


atcacctaat 


taataaacaa 


cagaaccaaa 




cttcaaatca 


aattaaacta 


tctgactcca 


gagcccaaaa 


aaaattt t 


ttgacttaat 


85321 


taacMaatat 


caacccacat 


tttaaaacraa 


cattttaaaa 


caccacttaa 




85381 


tttaaaaato 


cctataatcc 


aaaaattttc 




tacttcatcc 


aatatattcc 


85441 


tttattcaaa 


tYataaaqca 


qcatqttcta 


accactatct 


CTtqccctccc 


cttccaccqa 


85501 




aacctccctt 


tgctgtcaca 


qqccccaqqa 


ctqacctcta 


caaataaact 


85561 


cagcagggtg 


ccttqcctcc 


tqqccacctq 


tcaqqqttqq 


ttqacatqoc 


cccctctaaa 


85621 


qaqcaqqqqt 




qaqtatctct 


tcccctcacc 


ttcccaccac 


cqacacctac 




CQcccaacct 


ccct cqc 


unggiiraTigy 


ttctqactqc 


attc 


tagggggacc 


85741 


tctcccgtca 


tggggcccct 




tccaqcctcc 


a tacct t 




85801 


tctctggtgc 


tctqcaqatc 


tcaqccaaqc 


catcaqttct 




taaaaatgcc 


85861 




tacattttta 


gaagccatca 


^^^^^^ 


atcacctqta 


gttttttgtt 


85921 


tgttttctcc 


tttacRatgg 


ctctttgcag 


taacaaatgc 


aaaataaaaa 




85981 








tgtctttcaa 


atttqctact 


gagcgcatta 


86041 


ctgtagtgac 


aaqtqaaaqc 


attcacaqqa 




cattcacatc 


tgtcctttct 


86101 


gacctctcac 


ggtacaggca 


cacccattcc 


aacccaatYg 


cctctcatga 




86161 


cctccagatc 


tgcatgctcc 


acatctgtct 


tcagtctccc 


tggaggcctc 


ccaagattgg 


86221 


cggagcttgg 


ct caeca aga 


agatgacatig 


ctgggaacct 


taggcatttt 


ttttctataa 


86281 


ccaaatttct 


aggtcttgga 










86341 


tccctctggg 




tactaaaaqt 


acaqaqqaaa 


tqaaatacaq 


tcctqaaatt 


86401 


gaaccgggaa 


qcaqatqaaq 


qtqaccaqqc 


tcctqqtqct 


ctcatqaaaq 


aaqaqqctqq 


86461 


gagaggatgg 


ggcctggctg 


aaggtttctg 


ggagcaYagg 


aatcctcaac 


tccccttatc 


86521 


ctaagagcag 


caggtgccac 


aggaaaacac 


cactctgctc 


ctggggtcat 


tcaqcccqqa 


86581 


aggggagcgt 










aaccctqctc 


86641 


tggggagcac 


atggctattt 


ccgtatgtct 


qqacattctc 


tctqqactaq 


ttcactccqc 


86701 


acagttcctc 


cctgcctgca 




gacgctcaca 


ccagggactg 


aagctcgtcc 


86761 




aaacccaggc 


tgacctgtgg 


ctgcctctga 


cctgcagaac 


acagcagagg 


86821 


gaaggtttca 


ggagtcccag 


gctcaggcct 


t tatggaagt 


gggagcttcc 


ccttcctgcc 


86881 


tcctgaaacc 


aagtccccat 


accctcagca 


gcacagcaca 


tggcagagga 




86941 


agtggggcag 


ccagcaccaa 


cgccaggccc 


gggagggagt 


tgtcctggac 


agtccagaga 


87001 


actgactctc 


agatgaccac 


ggcccctgcc 


gacatcaggg 


aggcagcttc 


cagctgagtc 


87061 


cagtcagccc 


acagaatcat 


gaccaataat 


aaatgggttg 


ttgtactaag 


ctgctaggat 


87121 


ttggggtagt 


ttattacaca 


cacaggtcat 


tgaaacaccc 


tgtaagtgct 


ctagaggcac 


87181 


ggagaagata 


gagggtaaat 


agcttgtttc 


actgtcacta 


tggataagta 


caaagRtggt 


87241 


tccaacaYtc 


aactcctcct 


gtacccatgc 


tttttgccac 


ataactttgt 


gattcttccc 


87301 


attaaagagg 


tagaattact 




tatttttggg 


cggcctatat 


gatctgcttc 


87361 


agccaacagg 


cagtagcaga 


aagacaagta 


tgacagatcc 


tagctgaagc 


ctcaggaggc 


87421 


aggacatgtt 


cccctttgct 


tttctgtaac 


tttccaccag 


catgggaaca 


tggccagatg 


87481 


catgcgggag 


gatgagaggt 


acctgaggca 


gagtcaccca 


ccccaatcac 


ccSaccaaga 


87541 


ctacgtttga 


tcatcccata 


gacaaccaaa 


accccagaca 


tgggagggag 


gcaaccagac 


87601 


cagccgctag 


ccaagtccaa 


ccagaatggc 


caacccacag 


gctcatggct 


gaataagtgc 
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87 661 ttactgcttg aagccactga aatttgggga atttgttaca cagcattttt gtgacaatag 

87721 attactgaca tacactttgc aatgtacaat cagattaccg agatcttcgt taaaattttt 

87781 ctagccagtg ataaaaggag actcaaacag ggcatttcaa aggaagccac ttcatcccaa 

87841 cacagagaca taatatttcc actcacctac gagacatgat ctcaaaccct aatcctgccc 

87 901 tcaaaatggg tYtggaaact gaaggataac caagaaacct ccaccagaag cctccatggg 

87 961 attcccagag cccttcccag gagcaactga attctcactt cacatgatgc cctgccagcc 

88021 tcccctggat tccaaaaatc aaaacatcct tggatatcca agaccgcagc aacatcatag 

88081 aagttacagc agccactcgg tctgtaaMca accaagacat aaactaaatg agcaactgtt 

88141 tgagtctctt gtttcttatg gggaagtttt gagtactcag gttgcatcaa gtataaaatg 

88201 acagggaagg gaatagatag agattccaga tgcaacacag tcctgatcct tgagggatcg 

88261 agatgggaaa cagatctgtt tcatacagat atgaagcggt cagtcctgag agcaggcagt 

88321 tagtggaggt gcccagtgag gcgtgtggat atacctagag gaaagagaga gctgattcat 

88381 gcagagacaa gagggtgcca ccaaggagcc ttgatgaaca agaagcatga ggactggagg 

88441 aaagagggtg aaaggcatga agcagggaac ctaagtttct cccagctctg ccaMattaga 

88501 gaccaagggc gtatgagctt ctcacaccgc caccaccaga gtgatcacac accaaaggga 

88561 ttcactcagc acgcagcgtg atggcactgc caccctccac accagctcag cgaaaaccca 

88621 acggacctaa agaccaggag aataaaagct caaccaccct cccctgcctc ctcaacccac 

88681 acggatctct gactcttttt ttttttttga gacggagtct ctctctgttg cccaggctgg 

88741 agtgcagtgg tacgatctcg gctcactgca acctctgcct cccgggttca caccattctc 

88801 ctgcctcagc ctcccaagta gctgggacta cagccgccca ccaccacgcc tggctaattt 

88861 tttgtatttt tagtagagac gggatttcac cgtgttagcc aggatggtat cgatctcctg 

8 8 921 acctcttgat ccacccgcct cggcctccca aagcactggg attacaggcg tgagccacca 

88981 cgcccggcta attttttgta tttttagtag agacggggtt tcaccgtgtc ggccagatgg 

89041 tctcgatctc ctgacctcgt gatccaccca cgttggcctc ccaaagtgct gggattacag 

89101 gcgtgagcca ccacacctgg ccctctctga ctctttttgc agccatttga gtgcctgttc 

89161 ttttttacca tttcagaggt attttatgga ttttttaaag agtacttcta atttactttt 

89221 gtgttcctaa tggtcatttc tcaaatatgt agtagttggt aaccaaacaa actaaaacca 

89281 tactaaaaat ttttagtagt tggtgatagt ttcatgtcac tgtaggaacc tttttatttt 

89341 ctattttacc tgcctttgaa cccttgcaga acttcactct atgttgattg aaaatcattt 

89401 ctttaatact tgtttcttgc aataaactca tttgctaagg tatatacttc catatgattt 

89461 tgtgtttcat gacccagcta tattttatta ttgggtatct taaagaaaca aaaaaggcct 

89521 taacaataac agaattctac ttgtgtcact tcLtLgLLLL LgLLLLcaaa atgccaatgt 

89581 cttactaaat agaaacaMac ttgatttgaa ttacgtaagc tctaaccaca taaaaaagat 

89641 ggacLggLac ccaaggLaga agggaaaata agcattttgt aaLtatggag gcagtaLLgc 

89701 aaaatggtta agagaaggcc taaatatttg cagggctggt gcaagaaaat agatggaagt 

89761 ttgagacccc tcgaccctac ccctcctctc ttctcaaacc caattgtgtt ctgcgggtta 

89821 ggggatgcat atagtcacgt ggacatcccg atccacatgg ccacactcca ttcagcaatg 

89881 ctccctgctc atctttcaga cccagggaaa cacacacaag atcttcccaa gaagatgaac 

89941 tccaggaaac aggctggtgc agcccaaaag atgctcagtc aagtttggaa aggcactctg 

90001 ggtgccccag cacctggggc atgtcctgga atggggatgg gttcagctgg ggagggcagg 

90061 aatggggcct agaaggcata ggcaggacgc tgggctgggg gcagtgctcg atatgagcac 

90121 agtgcagagc cagatgaggg atggaaatca gatctgccac ttactccttg gaaacctata 

90181 aatgatcact ttatcctcca agagccttag ttacacattt gcacaatgag gataWtatta 

90241 ctgtagtata ataagaactc catttgccct tgtctctcgt ttctggcatg gagctcctaa 

90301 aactctggga atttcccaag tgatgggagc atcttttgtt ctaagaaggc agcaggttgg 

90361 tgagtcccta gatagcttca ggatgggggc taattgcaag aaagagtgag tcttgattag 

90421 atgctcagaa ccttcagccc catttcccaa cctctcagga gaggagagga gctagaaatt 

90481 gagttgatcc ccagtggcca gtgatttaat caatcctgcc tatgtaacaa aatctccata 

90541 aaataccctg acggggttag gacagcttct gggctggtga gcatattggt atgaagagtg 

90601 tggtgcaacc caatgctaaa gggacagagg cggccaggcg cggtggctca cgcctgtaat 

90 661 cccagcacct tgggaggccg aggagggtgg atcacgaggt caggagatcg agaccatcct 

90721 ggttaacacg gtgaaacccc atctctactg aaaatacaaa agattagccg ggcgtggtgg 

90781 tggacgcctg tagtcccagc tgctcaggag gctgaggcag gagaatggcg tgaacccggg 

90841 aggaggaggt tgcagtgagc cgagatcgtg ccactgcact ccagcctggg caacagagcg 

90901 agactccatc tcacaaaaaa aaaaaaaaaa aaaaaagggc agaggctcct gtacttggaa 

90961 tccttccaga ccttgccctg tgcacctctt catctggttg ttcattcata tgcttaataa 

91021 taaactgtaa tcataagagc atagcgtttt cctgagttct agcaagttgt tgcacctaaa 

91081 aggggggttg tgggaacccc caatgttgta gccatgtcag acagagtgta ggtaacctgg 

91141 ggacccaata cttgtaactg gcatctgaag tgaggacggt gttggggagc tcagccctta 

91201 aacccatgtg ctaacttccg gtagtcagtg tcagaactga attgaactgt tggacaccca 

91261 gttggtatca gtgttgaatg actggtKggt actggcgaag acatgatgta tttggtgttg 

91321 ggaaacatga ataaaaaatt tgtaaaacct ccaaacgaYg ttcagcaatt cacaaaggtg 

91381 tcagtgcaag agttggcctg tatctattct aactggtcag tatgcaagtt gaaccattgt 

91441 ttgattatct cctctgtcta caccatcagg cttttgtgag gattaaacta aattatgcct 
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91501 cagaaagaaa actctctcaa ggctaggcac ataatcaaca ctaaacaagt ggtaatcatt 

91561 ataattaaga aagtaaaaat atttcaaatg gacgattttg atactgtata aaatgttgca 

91621 actttttttt ttttgagacg gagtcttgct ctgtcgccca ggctggagcg cagtggcgca 

91681 atcttggctc actgcaacct ccacctcccg ggttccactg attctcctgc ctcagcctcc 

91741 cgagcagctg ggattacagg cacgtgctac catggctggc taattaaaat gttgcaactt 

91801 tttaggaaaa ataatcctca ataaataaat tgctaatttt cataagagcc ctttgtcaac 

91861 tgactaaaaa catccttaat ttagaaatgt attttttagt tttaagataa ctagtcatta 

91921 agtgaaaaca ttgtaggcaa aatggctcta agcaatttag tattcatagg aatgaatcca 

91981 ttccttgaaa gcattggtcg ctgggtgcac ttgtcctgga tgagcagata ttaggcaaat 

92041 cagtctttca ccagcaaacc ctgccataag aactgcacag gcctgattcc tgccaagcac 

92101 accaggcccc tgactggggc agaggcctcc cttatggggc aggaagaagg aggcagggag 

92161 aaacataggg cccatgattt aaggaggctc tccctctcag gctcatgcaa gtgcaggatc 

92221 agtacctaaa ggggagcacc gcccgaaatc ctgtgcccca gtcaccttaa cccagtccag 

92281 ccctggctcc tgcactccaa agctaagcag ggaataccag cgggagacag aagacagcat 

92341 gagaaactga atgcatctcc cacatttatc taaaaataag aaaMtggact tcttcagtag 

92401 gaaatggcty atctttccaa atgaataaaa atacttatag ctatgggttg aattgtatca 

92461 attctctccc ccagttcatg cactggagta ctaacccccg gtccctcggg atgtgatgat 

92521 agagaggtag ggcctttaca gaggcaatca aattaaagtg aggtcattag tgacctaacc 

92581 caatgtgact gatccactta caaaaaaagg gacaagattt aaagacagac ctgtatagaa 

92641 ggacgatggt gtgaagacac agggaaaaga caaaagaggc ctggaacaga ttttcctcaa 

92701 agccctcaga gggaaccact tgggctgaca ccttgatctt ggacatccag cctccggaac 

927 61 ggtgagatga tgaatttctg ttgttttaag ctacccagtt catggcagtt tgttacaaca 

92821 gcactgggaa aggaaatcac ctctatggag ggacttacga aagtcactta actaatgcag 

92881 attagaggcc tcacagtcaR ggcatggcct caagagccac atccatttta attgtaccat 

92941 ttctcactga actctgaccc tggtcaaggc atgtctcagg ctccatggac tactattcct 

93001 tccactgatc gaccctggct tcattctggc aagggaggga ctcatacatt ctgcggcctg 

93061 catgagtgtc atgtgagcac ccccataacc ttgatccctg cggtgcttct ccaggggagc 

93121 cctagcattc cagctggcct catgggcatg ccttagagag agggagctga ccctctcctt 

93181 cttcaactgg attttatttt tattttttgg catgttttgt gaatacaaag attcactgcc 

93241 tggqtgtgcc ccttgaaaga aggtttcaat tgcattcagg tgaaataacc acaaaatgac 

93301 tgtaagccaa atgtagtagt atgatagaaa aatggaaata tcgtaaaagc accagtctgt 

93361 agacacctag cttctggttc ctctggaaaa gggagaagag agtagacagt gatggagaaa 

93421 agaggggctg tgaagactat agaactgaga aggcagtaat tctaaaatct ccacactgta 

93481 gggccaggaa caggcagtgc tgggcaatgg ggcatcttat ctcagggaag tcaagggtgt 

93541 catgtagaac caacataatt catccaagta cagcaaagaa tccagaaatg gtctgtggag 

93601 gtttggaacc atgaaaatca aataggaaag atacaaaata tttacatatc cacatttttt 

93661 aaaaggggtc ctgactaggc agcataagat tatttgggtt ctgttcattt tacctttagt 

93721 tgcccttggt tctgccatct tttttctctg tgagtcattt gtcttgcttt tggtcaacac 

93781 ggctttcctc gggtctccaa agatcctgga ataacctgaa tataagttga agaatattaa 

93841 accttatttt gttttcaaat attaatacca gggatcaact gtacatgtac cgacaaaacc 

93901 tatttcttaa acattatttt aaaatatcct aaaaggggaa agaacttaaa tccttcccct 

93961 taaaaaaaga ggggacaaca ttcctctcct ctgagggaag cccagcatta tgaagcaaaa 

94021 atcgggagtg aaagcaaggc caagcctaac tggcatgctg aagcacacag atgcttttgc 

94081 acagaaaata ttttgaaaat gttttctttt ccttatagcc cccagctttg gtaataatgg 

94141 cttccctaga tcattctgtt agcactttct gctgctcaga acagggtagt gggatttaga 

94201 atttagaaaa catcccgaga actgtaatta agcaattgtc atagtcccag aaaagaaaca 

94261 acccttttac tttgtaaatg ctaggaggat ctgtattttt taaaatctta gaacactaac 

94321 tagctaaaaa gatagaaaga aaaaaatctt tcttacaata tgcctgtagc acggctccaa 

94381 aatccagtcc ctctctctct tctttttttt tttttttttt ttttttttag atacaggaat 

94441 ttcttacaca actttccaag taaaatattc aatccccaaa taatatattt agtaRaatcc 

94 501 ttacaacatg aRaaattgta aatggcatct gtttccgttg aaaatatttt taaattagaa 

94 561 aacatcttaa gcactgcatc aaatcaatat ttgtaatggg cagaattatt tctaaatggt 

94 621 tccacaggcc agttgacatt cccactctac ttaggaagcc cccagccaac tccacctgaa 

94 681 gtcttcgaaa cctatgcata gaggcaaatt gcccagatgt cacagaaaag catttgttgt 

94741 gatacagaaa ataaaacaca ttactggggg aaaaaaatca gggtactaaa ccacgtacat 

94 801 aatatcatcc caatattgca caaagaaaaa aaaataagtg gatgaaataa aaga.gggaag 

94 861 gcaggaaagg aaagaggatg acctgagaaa agttatcaaa atggttaaca gtcatggtct 

94921 ttggctttcc agtgtgaaat aggagttctt attttcctct ttttaaaata tttttaaaat 

94981 tatttatttt ttaaaccaac taataaaagt aatatatatt taccgtgtac aacattatgt 

95041 tttgaaatat atatatatat atatatatat atatatatat atatatatat atatatatat 

95101 atatacatgt tggaatggct aaagtaagct aattaacatc tgcattacct cacatactta 

95161 tttttttgtg gtgcgaacac ttgaaatcta ctcttagtga ttttcaagaa tataatgcat 

95221 tgttattaac tacagtcaat agatcatagg tctattcaag ttcaagtcac agatctcttg 

95281 aaattactcc tcctgtctaa ctgaaatttt gtgtcctttg accaacatct ccccagttcc 
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aca cac aa 




aa gcaaa c 


gaaatcatga 


gag g ca 


95401 


aa^t''tt 


agaa ggc 


ca a caaa 


aagacaag g 


ttggtgggga 


g ggaaaaa 


95461 


gggaac c 


gcccaccg 


gg gggaacg 


aga cag a 


cagtcattat 


ggaaaacagg 


95521 


ggagg c 


c caaaaaca 


caaaga caa 


acaaaaca 


aaa ac a a 


g acaa 


95581 




ac caa g 


gaga a 


caaa caaaa 


ac acatga 


ttgctggcaa 


95641 


ccc c c g 


gaga aga c 


caaagaaaa 


ggaaaa gaa 


a egg atgt 


caaagaggta 


95701 


ccagccc cc 


caaa ca 


acagca a 


cacaacaga 


caaggcaligg 


catcaaccta 




ag g ^^^^ 


aaca aaacg 


ccca aaaga 


aaag gg gc 


acatacacaa 


tggaacacta 


95821 




agaagagaaa 


cc g ca 


g gacaaca 


gaacaaacc 


tggagaactt 


95881 


t t t 
a gc aaa 


gaaa aagcc 


aggcacagaa 


^^^"^ 


gcatcctct 


cacttataca 


95941 


agca c ga 


aaag gaac 


"^^^ 


c atag 


ttgtgtat 


tttccaaatt 


96001 


ataatcct^^ 


aaaa ^ 


c 


caggggagaa 


aactaaggaa 


taatcatttt 


96061 




aca gc 


aaagca a 


cac g a 


a gta tc t 


ttcccateca 


96121 


g ca aca g 


ggcccaagga 


gc ggcc c 


cc gc cggc 


agcctgatgt 


tcaaggggat 


96181 




gacgg gcag 


aga catgc 


tttgcaaagc 


tgtgcaaatc 


tttttttttt 


96241 


ttttttgaga 


cggagtctcg 


ctctgtcgcc 


caggctggag 


tgcagtggca 


cgatcttggc 




cac gcaac 


c ccgcc cc 


caggttcaca 


ccattctcct 


gcctcagcet 


cccgagtage 






ggcgcccgcc 


accacacccg 


gctaattttt 


gtattttagt 


agagaegggg 






agccagga 


cggtctcaat 


ctcctgacct 


cgtgatccae 


ecgcctcggc 




caaagtgctg 


ggattacagg 


cgtgagccac 


ggcgcccggc 


caaagctgtg 


eaaatettga 


qfiR4i 


gtitiaataaac 


a'tgt'tcccca 


aatccaaaag 


ccaatattga 


ctattgcttg 


gttaactgtt 


96601 


"^^^^"^^^^ 


aagtggttaa 




ggtttccaat 


't't ctgc aaag 


ggaataaaag 


96661 


aaaagc 


g gaac aag 




ctatgctatt 


ctatagtagg 


tttttatgcc 


96721 




c ggc gaa 


agga aa 




tgagacattg 


ttgggcctga 


96781 


aaagccc 


agag caagc 


ccagagagga 


c c taaa 


tatgtgcaat 


gggtattttt 


96841 


aaa agca 


a ggcg a 


ttaatatttc 


atttatcctg 


agagcattta 


cctgacaacg 




aaa a g 


g aacacagc 


tgagaagcca 


caaaggatcc 


aaaaagatgt 


cacgaagtcc 


96961 


acgaagagaa 


gc gccag 


agaaaggcac 


actaccagcc 


gcgtgagctg 


gcctcatcaa 


97021 


agcaaaaaaq 


cagaaaa t 


a g aaacca 


gactcaagaa 


cteaaaaata 


cagtcgtttg 


97081 


agaaca gg 


aagag gca 


aac gctcac 




tcacccctgg 


tccagcaacg 


97141 


gcagaaagcc 


aagg caagg 


ag caca ga 




caecccccac 


caccaccaec 


97201 


t t *t 


ccc ^"^^^"^ 


ggccagtggg 




tgacccctgg 


gattcagtat 


97261 


cag cagg 


eg g c 


gtatggggag 


gtagaagaag 


gcaccagate 


ateaggaaat 


97321 


c ggac gga 


a ggccaaa 


gctgaacaca 


c gag acc 


tgaagggtaa 


gcttaaggtg 


97381 


caaagcagc 


ca gg gagg 


cagaaagaac 


c gage g 


gggccccaag 


tttatgctga 


97441 




gccaacaaaa 


cc ccc cag 


cc cag g 


ctcatctgaa 


aactgtgagt 


97501 


t 


gac ccagag 




?f 


gaaggggtcg 


ctgatggtcc 


97561 


aga ggacag 


a gcaaagca 


ctgc tctg 


tttttgtctt 


tgttttgata 


caggattctc 


97621 


c g ca cc 


aggc gg gt 


gcagtggtgt 


caa t age tea 


ctgcagcctg 


gaccctccct 


97681 


ggc caag g 


a cc cccac 




caactagcte 


agactatagg 


aaaaagagtc 


97741 




a gcc ggc 




aaaaaaNttt 


tggagacatg 


agate tcact 


97801 








cctcaagcaa 


tcctcctctc 


tcagcctccc 


97861 


aaagtgcttg 


gattagaggg 




atgcccagca 


aaggcattgt 




97921 


aatgaatgac 


caccttgttc 


cccaaaacac 


tagacaagag 




tacgcacaga 


97981 


tctccctggc 


caccacgtgt 


aagcagcagt 


gaatattcca 


taggtggaac 


tggctactgt 


98041 


gaagaaagaa 


aactccagaa 




cactccaaga 


agtagcctgg 


accccacgtg 


98101 


cgtgacagca 


gcctggggag 


aaggctccac 


tcccaccagc 


aatgccaagg 


acagagcagc 


98161 


agcagccgtc 


catccccaca 


gttatggtga 




agagcagcct 


gctttttcca 


98221 


tgtatagcca 


cacacaggtg 


gcaatggcag 


cagacgtaac 


tggctgccct 




98281 


tggaaggaaa 


cataattttg 


cttgggtgtt 


ctgeetgccc 


ccacatgacc 


caYggaggat 


98341 


gaccagcccc 


aacaccagtg 


taaatcctat 


cactgggcac 


catctcttct 


caccacggat 


98401 


tggtttaggg 


aagggcacat 


gatataattc 


tggtcagtct 


aacactcggg 


gaagcgagct 


98461 


agaggacttc 


taggaaaagt 


gttggctgat 


gaagaaaggc 


catagagtag 


tagtcctgtc 


98521 


tgtgtggcac 


ttggaactgt 


gagagccatc 
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[0269] Following are cDNA sequences for PSMBl (SEQ ID NO: 5), TBP (SEQ ID NO: 6), PDCD2 
(SEQ ID NO: 7 and 8), ELP3 (SEQ ID NO: 9), CHDCl (SEQ ID NO: 10), and ERG (SEQ ID NO: 11 
and 12). 

PSMBl cDNA sequence fSEO ID NO: 5^ 

NM_002793 Homo sapiens proteasome (prosome, macropain) subunit, beta type, 1 (PSMBl), mRNA 

1 aaggcagcca tctcgccgtg agacagcaag tgtcgcgcag ccgtgcgatg ttgtcctcta 

61 cagccatgta ttcggctcct ggcagagact tggggatgga accgcacaga gccgcgggcc 

121 ctttgcagct gcgattttcg ccctacgttt tcaacggagg tactatactg gcaattgctg 

181 gagaagattt tgcaattgtt gcttctgata ctcgattgag tgaagggttt tcaattcata 

241 cgcgggatag ccccaaatgt tacaaattaa cagacaaaac agtcattgga tgcagcggtt 

301 ttcatggaga ctgtcttacg ctgacaaaga ttattgaagc aagactaaag atgtataagc 

361 attccaataa taaggccatg actacggggg caattgctgc aatgctgtct acaatcctgt 

421 attcaaggcg cttctttcca tactatgttt acaacatcat cggtggactt gatgaagaag 

481 gaaagggggc tgtatacagc tttgatccag tagggtctta ccagagagac tccttcaagg 

541 ctggaggctc agcaagtgcc atgctacagc ccctgcttga caaccaggtt ggttttaaga 

601 acatgcagaa tgtggagcat gttccgctgt ccttggacag agccatgcgg ctggtgaaag 

661 atgtcttcat ttctgcggct gagagagatg tgtacactgg ggacgcactc cggatctgca 

721 tagtgaccaa agagggcatc agggaggaaa ctgtttcctt aaggaaggac tgatctgtgt 

781 gctcttatca ccaatcagtt cagacctggt tgattttgta ctttggaact gtaccttgga 

841 tggttttgtt tattaaaaga gaaacctgaa gt 



TBP cDNA sequence fSEO TD NO: 6^ 



NM_003 194 Homo sapiens TATA box binding protein (TBP), mRNA 



1 


ggttcgctgt 


ggcgggcgcc 


tgggccgccg 


gctgtttaac 


ttcgcttccg 


ctggcccata 


61 


gtgatctttg 


cagtgaccca 


gcagcatcac 


tgtttcttgg 


cgtgtgaaga 


taacccaagg 


121 


aattgaggaa 


gttgctgaga 


agagtgtgct 


ggagatgctc 


taggaaaaaa 


ttgaatagtg 


181 


agacgagttc 


cagcgcaagg 


gtttctggtt 


tgccaagaag 


aaagtgaaca 


tcatggatca 


241 


gaacaacagc 


ctgccacctt 


acgctcaggg 




cctcagggtg 


ccatgactcc 


301 


cggaatccct 




caatgatgcc 


ttatggcact 


ggactgaccc 


cacagcctat 


361 


tcagaacacc 


aatagtctgt 


ctattttgga 




aggcagcagc 


agcaacaaca 


421 


acagcagcag 


cagcagcagc 


agcagcagca 


gcagcagcag 


cagcagcagc 


agcagcagca 


481 


gcagcagcag 


cagcagcagc 


agcagcagca 


gcaacaggca 


gtggcagctg 


cagccgttca 


541 


gcagtcaacg 


tcccagcagg 


caacacaggg 


aacctcaggc 


caggcaccac 


agctcttcca 


601 


ctcacagact 


ctcacaactg 




gggcaccact 


ccactgtatc 




661 


gactcccatg 


acccccatca 


ctcctgccac 


gccagcttcg 


gagagttctg 


ggattgtacc 


721 


gcagctgcaa 


aatattgtat 


ccacagtgaa 


tcttggttgt 


aaacttgacc 


taaagaccat 


781 


tgcacttcgt 


gcccgaaacg 


ccgaatataa 


tcccaagcgg 


tttgctgcgg 


taatcatgag 


841 


gataagagag 


ccacgaacca 


cggcactgat 


tttcagttct 


gggaaaatgg 


tgtgcacagg 


901 


agccaagagt 


gaagaacagt 


ccagactggc 


agcaagaaaa 


tatgctagag 


ttgtacagaa 


961 


gttgggtttt 


ccagctaagt 


tcttggactt 


caagattcag 


aatatggtgg 


ggagctgtga 


1021 


tgtgaagttt 


cctataaggt 


tagaaggcct 


tgtgctcacc 




ttagtagtta 


1081 


tgagccagag 






cagaatgatc 


aaacccagaa 


ttgttctcct 


1141 


tatttttgtt 


tctggaaaag 


ttgtattaac 


aggtgctaaa 


gtcagagcag 


aaatttatga 


1201 


agcatttgaa 




ctattctaaa 


gggattcagg 


aagacgacgt 




1261 


atgtaccctt 




cccccttctt 


tttttttttt 




agtttgtttt 


1321 


ggtaccttta 


aatggtggtg 


ttgtgagaag 


atggatgttg 


agttgcaggg 


tgtggcacca 


1381 


ggtgatgccc 


ttctgtaagt 


gcccaccgcg 


ggatgccggg 


aaggggcatt 


atttgtgcac 


1441 


tgagaacacc 


gcgcagcgtg 


actgtgagtt 




tgctgctatc 


tgggcagcgc 


1501 


tgcccattta 


tttatatgta 


gattttaaac 


actgctgttg 


acaagttggt 


ttgagggaga 


1561 


aaactttaag 


tgttaaagcc 


acctctataa 


ttgattggac 


tttttaattt 


taatgttttt 


1621 


ccccatgaac 


cacagt:tttt 


atatttctac 


cagaaaagta 


aaaatctttt 


ttaaaagtgt 


1681 


tgtttttcta 


atttataact 


cctaggggtt 




cagacacatt 


ccacctctcc 


1741 


agtattgcag 


gacagaatat 


atgtgttaat 


gaaaatgaat 


ggctgtacat 


atttttttct 
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18 01 ttcttcagag tactctgtac aataaatgca gtttataaaa gtgttaaaaa aaaaaaaaaa 
18 61 aaaaaaa 



PDCD2 cDNA sequence 1 rSEO ID NO: 7) 

NM_002598 Homo sapiens programmed cell death 2 (PDCD2), transcript variant 1, mRNA 

1 tcttgccttc cggcccggcg cccgatttcc gccttccgac ccagctgtgg gctgcgcccc 
61 acgccagccc gcgccccgca tggctgccgc cggggccagg cctgtggagc tgggcttcgc 
121 cgagtcggcg ccggcgtggc gactgcgcag cgagcagttc cccagcaagg tgggcgggcg 
181 gccggcatgg ctgggcgcgg ccgggctgcc ggggccccag gccctggcct gcgagctgtg 
241 cggccgcccg ctctccttcc tgctgcaggt gtatgcgccg ctgcctggcc gcccggacgc 
301 cttccaccgc tgcatcttcc tcttctgctg ccgcgagcag ccgtgctgtg ccggcctgcg 
361 agtttttagg aatcaactac ccaggaaaaa cgatttttac tcatatgagc caccttctga 
421 gaatcctccc ccagaaacag gagaatcagt gtgtctccag cttaagtctg gtgctcatct 
4 81 ctgcagggtt tgtggctgtt taggccccaa aacgtgctcc agatgccaca aagcatatta 
541 ctgcagcaag gagcatcaga ccctagactg gagattggga cataagcagg cttgtgcaca 
601 accagatcat ctggaccata taattccaga ccacaacttc ctttttccag aatttgaaat 
661 tgtaatagaa acagaagatg agattatgcc tgaggttgtg gaaaaggaag attactcaga 
721 gattataggg agcatgggtg aagcacttga ggaagaactg gattccatgg caaaacatga 
7 81 atccagggaa gataaaattt ttcagaagtt taaaactcag atagcccttg aaccagaaca 
841 gattcttaga tatggcagag gtattgcccc catctggatt tctggtgaaa atattcctca 
901 agaaaaggat attccagatt gcccctgtgg tgccaagaga atattggaat tccaggtcat 
961 . gcctcagctc ctaaactacc tgaaggctga cagactgggc aagagcattg actggggcat 
1021 cctggctgtc ttcacctgtg ctgagagctg cagcttgggt actggctata cagaagaatt 
1081 tgtgtggaag caggatgtaa cagatacacc gtaaaggcat cttaaagcct tgaaaaatgt 
1141 taataatctt ttataccttg caattccatt tctgggattt tatcctaagg aaatacttat 
1201 accaaaaata gaggtgcaga gatgttgaca gattgcttac acagtgtcta cttattagtg 
1261 aaacaaaagt gtccagtgac agggaattaa ataaattttg gtacatccac a 



PDCD2 cDNA sequence 2 (SEP ID NO: 8') 

NM_1 44781 Homo sapiens programmed cell death 2 (PDCD2), transcript variant 2, mRNA 

1 tcttgccttc cggcccggcg cccgatttcc gccttccgac ccagctgtgg gctgcgcccc 
61 acgccagccc gcgccccgca tggctgccgc cggggccagg cctgtggagc tgggcttcgc 
121 cgagtcggcg ccggcgtggc gactgcgcag cgagcagttc cccagcaagg tgggcgggcg 
181 gccggcatgg ctgggcgcgg ccgggctgcc ggggccccag gccctggcct gcgagctgtg 
241 cggccgcccg ctctccttcc tgctgcaggt gtatgcgccg ctgcctggcc gcccggacgc 
301 cttccaccgc tgcatcttcc tcttctgctg ccgcgagcag ccgtgctgtg ccggcctgcg 
361 agtttttagg aatcaactac ccaggaaaaa cgatttttac tcatatgagc caccttctga 
421 gaatcctccc ccagaaacag gagaatcagt gtgtctccag cttaagtctg gtgctcatct 
481 ctgcagggtt tgtggctgtt taggccccaa aacgtgctcc agatgccaca aagcatatta 
541 ctgcagcaag gagcatcaga ccctagactg gagattggga cataagcagg cttgtgcaca 
601 accagatcat ctggaccata taattccaga ccacaacttc ctttttccag aatttgaaat 
661 tgtaatagaa acagaagatg agattatgcc tgaggttgtg gaaaaggaag attactcaga 
721 gattataggg agcatgggta agcagtttca ggacttcatt cattaagtgg ttaaacataa 
781 tacttggaag aaagggctcc atgtgcctag aagagaggta ctgagaggaa gactcacttt 
841 ggaggctgta gcatacaatt ttcagatatt gcctcaggta aaaatatact tcctggactt 
901 tgttttctga cacataagag gtgtgttctg ctccctgtaa agacaagggt gggtatccag 
961 atggtcccat gagtagggct gcacaagatg ctggaggctt ggtaagttcc tctgggtcgc 
1021 agatcggttt ctcgggtcgg gatagtgtga gtgcctagca cagtgtcggg cacgcagaag 
1081 ggccccttaa aagtttctct ttcatctggc cagttttaga tacacaattt tgtcagttta 
1141 cttacagtgc atactcttgg gtagtacttg tgctgaccaa gtatcttaga ggcttatttt 
1201 attatagtag ccaacattta tccagcactt accttatata aagggctgtt tgtgcatgag 
1261 ctcattaaaa tcgtgacagc agaccaatga gtgagaaact gccccatttt gaaggtgagg 
1321 aaattgaggt tctgggtata actttctttg gtcacataat attaaatttt acaatttgag 
1381 ccttgagcca tacacaaaac caccacaaaa ttagatttat agactcaaaa tgaaaacatc 
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1441 agcttactgg tttgtagttc ataccagtca 

1501 ctgtgcctga ccttgtgctt gataacaggg 

1561 gatgctcaca gtcttatgga ggagcccaaa 

1621 tgactgataa gagtacaata caggtgccat 

1681 ggaaggggcc gcgcgtgtgt tcatgcctat 

1741 tatctttagg agaaaggaga gcctagagta 

1801 atatgccttg ttagtgtaaa tgtgactgtg 

1851 taaagtaagt tttaaaaagc agtccctaat 

1921 atataaccac actaagaact cttccaggtg 

1981 tagaatatgc cctgagaatg aaaagaatgt 

2041 attactaaaa aaaaaaaaaa aaaaaa 



tacattccaa aacatgtttt gagtcttact 
atataatggg aagcaacact ccagtggtca 
taatatctgg ggaagttaaa gtccatataa 
gggaacacgt gacatcactg aagactgcct 
acgataaaca tgatacataa tgaaaatgct 
gcaggatcaa ggatgaaagc tggacttcaa 
gaactgtatg agtattttaa gattatggag 
catcaaaagt aaaaaactct tgatgtagtc 
acttcaaaac ataggacagt acatctctag 
aacagtgtta gtattttgaa taaacatgtt 



ELP3 cDNA sequence ( SE Q ID NO: 9) 

NM_0 18091 Homo sapiens elongation protein 3 homolog (S. cerevisiae) (ELP3), mRNA 

1 gcagaaatga ggcagaagcg gaaaggagat ctcagccctg ctgagctgat gatgctgact 
61 ataggagatg ttattaaaca actgattgaa gcccacgagc aggggaaaga catcgatcta 
121 aataaggtga aaaccaagac agctgccaaa tatggccttt ctgcccagcc ccgcctggtg 
181 gatatcattg ctgccgtccc tcctcagtat cgcaaggtct tgatgcccaa gttaaaggcg 
241 aaacccatca gaactgctag tgggattgct gtcgtggctg tgatgtgcaa accccacaga 
301 tgtccacaca tcagttttac aggaaatata tgtgtatact gccctggtgg acctgattct 
361 gattttgagt attccaccca gtcttacact ggctatgagc caacctccat gagagctatc 
421 cgtgccagat atgacccttt cctacagaca agacaccgaa tagaacagtt aaaacaactt 
481 ggtcatagtg tggataaagt ggagtttatt gtgatgggtg gaacgtttat ggcccttcca 
541 gaagaataca gagattattt tattcgaaat ttacatgaUg ccttaUcagg acatacttcc 
601 aacaatattt acgaggcagt caagtattct gagagaagcc tcacaaagtg tattggaatt 
661 actattgaaa ccagaccaga ttactgcatg aagcgacatt taagtgacat gttgacctat 
721 ggctgcacaa ggctggagat tggggtgcag agtgtttatg aagatgtggc tagagacacc 
781 aacaggggcc acactgtgaa ggcagtgtgt gagtcatttc acctggccaa agattccggt 
841 tttaaagtgg tggcccatat gatgcctgac ctgccaaacg tgggactaga aagagacatt 
901 gaacagttca cagagttttt tgagaaccct gcttttcgtc ccgatgggct gaaactctat 
961 cctaccctgg tgattcgtgg gaccgggctt tatgagcttt ggaaatcagg aagatataag 
1021 agttactctc ctagtgacct ggttgaattg gtggctcgga tcctagccct cgtgcctcca 
1081 tggactcgag tgtaccgagt acagagggat attccaatgc ctttagttag ctcaggagta 
1141 gagcatggta acctgagaga gctggcactt gcaagaatga aagacctcgg aatacagtgt 
1201 cgagatgtga gaaccagaga agttggaatc caagaaattc atcacaaagt acggccatac 
1261 caggttgaat tggtaaggag agattatgtt gcaaatggtg gctgggaaac attcttgtca 
1321 tacgaagacc cagatcaaga cattttgatt ggcctcctac gattacgcaa gtgttcagaa 
1381 gaaactttcc gtttcgaatt gggtggaggt gtctccatag tacgagagct gcatgtgtat 
1441 gggagtgtgg tccctgtgag cagccgggat cctactaaat ttcagcatca gggatttggc 
1501 atgctgctga tggaggaagc agaaagaata gctagagaag aacatgggtc tgggaaaatc 
1561 gctgtgatat caggggtcgg caccaggaat tattatagaa agatcggcta cagattacaa 
1621 ggcccgtaca tggtgaagat gctgaaataa tggccacacc agtccactct tctgcagtat 
1681 cctccctggc agaacacgga gaatcaggat ttcttaaata ctcaacagag aggctgagca 
1741 gagcaaatgg ggggcttcac cctcatcccg cagctgcaga gactggaaac tgccttcaag 
1801 gccacggctg gtcatctgct gaccacaccc cagatccgcc ctctcctgcg tgcaccccaa 
1861 aaaatcactt gcgtttttga ggcttaaatc atctatccag tttctacatt ttgcatgagg 
1921 cctgcaggtg gcctattttg actcagacgg tgaaaaaagc aaattaactc atttggacac 
1981 cataactcat gcaataaaac tgattgtcat tcgaggagca aacttaagag tagtttattt 
2041 atataccctg gggacagaaa gtcaggttga aacaggaaaa ccaccagact ctaatctcag 
2101 ccctttaacg acatacgcat tggagcgcaa gttaggaaaa tgagcttttg ttttcatgga 
2161 aatcattctg attacagtgc tgatgtttag aaataaatag cagtgtgact gggaaagagg 
2221 aattLgcagtt gtgggggtgt gagcctggca gcagccagcc agcagcctct cccaggcggg 
2281 agtctaccat ccgagacggc gatgacaaag agcttcattc cacattcttt gttatctcta 
2341 cttcccaccc tcttggcaac tacagagcag tgtgggcagc cccaagtgtg gtccccagag 
2401 agcgtttggc tttcctgtct gtctatcctg agcgggtgga gtctcaggtt gtgtgcccct 
24 61 aaatcaagat ttgcttccac agaagccatt acttgcaatt tttttttttt tttctgagaa 
2521 agtctcgctg tgtcacccag gctggagtgc agtggcgcaa tctcactgca tcctccgcct 
2581 cccgggttca agcgattctc ccgcctcagc ctcctgagta gctgggatta caggcacccg 
2641 ccgctgctaa tttttgtatt tttagtagag atgggggttt caccatattg gtcaggctgg 
2701 tctcgaactc ctgacctcag gtgatcaacc caccttggcc tccctaaatg ccgggattac 
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27 61 aggcatgagc caccgctccc agcctttgat 
2821 ggagaaaggt aagagttcaa gttcaacccg 
2881 ttggcttctt caaaggctcc tcttgtagaa 
2941 tttggagact ctgcctggag agggtcagtt 
3001 tcctgtggca ttttacaaag gtttaaagga 
3061 aaagtcttta acatataaaa aaaaaaaaaa 



tttttaaggt ggattttggt tgttataaat 
tgtgtgaaag caaaacaatg gaaaacagga 
ctgcctcttt gaaatttcga ggtaatctac 
cctaagttaa aagcatcgct taaccttggc 
attgattcct ctgaaagggc ctgaaaataa 



CHDCl cDNA sequence fSEO ID NO: lO") 



NM OISI 16 Homo sapiens calponin homology (CH) domain containing 1 (CHDCl), mRNA 





ccgcagtcct 


age ^^"^^ 


gggacaggaa 


accttcaaga 


ccgagctgcc 


acggccgcct 


61 


ccccgcccgc 


ccccca tot 


acgcgcctgc 


ccacaccctc 


ctcccctcct 


tccagcgcct 


91 


ttcggtggag 


cactgcggca 


ctcagcccga 


gctgccgttt 


tcccctcgcg 


gggaacgctg 




tgaccccccc 


gcaggagcgg 


cggggcgggg 


tgggggggcc 


cgggagaaga 


tggcgacgcc 




gggaagcgaa 




tcgtcccggc 


cctttcggta 


gctactctgc 


acccacttca 




tcatccccac 


caccaccacc 


accaccatca 


gcaccacgga 


ggaaccggcg 


cccccggcgg 




ggcgggtggt 


ggcggcggtg 


gcagcggggg 


cttcaacctg 


cccttgaacc 


ggggtctgga 




gcgcgcgctt 


gaggaggcgg 


ccaact:ccgg 


ggggctgaac 


ctgagcgcca 


ggaaat:tgaa 




ggaa ccc 


cgtaccgcag 


cccccgggca 


tgacctctcg 


gacacggtgc 


aggcagactt 






agac gg g 


aagttccaat 


ggaattgtgc 


cattttgtat 


cactggaaat 


601 




a cacaac 


gtatcagagt 


cattcctgag 


gccatcgtta 


atctgcagat 


661 


gc gac ac 


c gaac ga 


g cgaaa ca 


gctgtccgcc 


ctgcctgcct 


gcctgtgtgg 


721 


c gcc c c 


aaag 


cgcaag aa 




ggatcattac 


cagaagagat 


781 


agg cage c 


aaacag aa 


ggagc gga 


tgtcagctgc 


aacgagatca 








ggtcag ga 


aa ctctacg 


agaactgaat 


gtcagaagaa 


attaccttaa 




^ ^^^^ 


caagaactag 


tagatcttcc 


cttggtaaag 


tttgactttt 


cctgcaacaa 


961 


?^ c^^t 


attccaattt 


gttttagaga 




ctgcaagtgt 


tactacttga 


1021 


gaa, aaccc 


c g^ag c c 


c ccagcaca 


^^^^^'^'^^^ 
ga gcaca 


aagggcaaag 




1081 


-aag a c g 


agca acaag 


ca gccaga 


aagacagc 


gac ccc 




1141 


ca ggagagg 


ataat 


accagcacg 


ggaaga ggc 


aagaagga 


c ga cggg 


1201 


t^tta^cct 




a aagcga 


a c gccacc 


gagcc c g 


acgaagacac 


1261 






tt ^ 


ca ggaagaa 


gaacaga ca 


caaggagga 


1321 


ctcgtgccat 


tt 


ccg aaagg 


ggaa ca 


^^^^ 




1381 


ttcccttttg 


t 








c ga agagc 


1441 




cattcggaat 


ttatgaacta 


taaggcaagg 


gcagaagact 


gtgaagagct 


1501 


gttacggata 


gaagaggatg 


tgcactggca 


aactgagggc 


ataataagtt 


catccaaaga 


1561 


tcaggacatg 


gatatagcaa 


tgatcgagca 


gctgagagaa 


gcagtagatt 


tgctgcaaga 


1621 


tcccaatgga 


ttaagcacag 


atattacaga 


gagaagtgtt 


ttaaacctat 


atcctatggg 


1681 


atcagcagaa 


gccttagaat 




tgcactgaat 


ggtcaaatac 


agctggagac 


1741 


atctccggtg 


tgtgaggtgc 


aaagtgatct 




agtaacggga 


gccagtattc 


1801 


tccaaatgag 


attagagaga 


actcccctgc 






gcacagctcc 


1861 


atttggcctg 


aagcctcgat 




aagacctcag 


agaaatttgg 


aatctataga 


1921 


cccgcagttt 


acaatccgga 


ggaaaatgga 


gcagatgaga 


gaagagaaag 


agctggtgga 


1981 


acaacttcgt 


gagagcattg 


agatgagatt 


gaaggtcagt 


ctacacgaag 


acctgggggc 


2041 


agccctcatg 


gatggtgtcg 


tcctctgcca 


tctggtcaac 


cacatccgcc 


cacggtcggt 


2101 


tgcaagcatc 


catgtcccat 


caccagcggt 




agcatggcca 


aatgcagaag 


2161 


aaatgtggaa 


aactttttgg 


aagcgtgccg 


aaaattagga 


gtaccagagg 


ctgacctctg 


2221 


ctctccgtgt 


gacatcctgc 


agttggattt 


tcgtcacatt 


cgaaagactg 


ttgacactct 


2281 


gctggcactc 


ggggagaaag 


ccccaccacc 


aacttctgcc 


ctccgctcca 


gggaccttat 


2341 


aggcttctgt 


cttgtccata 


ttctctttat 


agtgctggtc 


tatatcactt 


accactggaa 


2401 


tgctctgtcc 


gcataacgtc 


tgcacgtgca 


tccaaacgct 


gtgctctgtc 


gccctcaacc 


2461 


tttgcagggt 


ccttcctacc 


tttgagcctt 


tgccttgcaa 


acttccatcc 


ctgtcatgtc 


2521 


ttcagttatc 


tctcgagttt 


tgaagctgaa 


cagtagcaaa 


tcagattttc 


cagaagcaca 


2581 


aactttgtag 


aatacagttt 


agtataattc 


ctctcactta 


ctgaaataca 


acgacgacga 


2641 


ctgcaaagtg 


tatgcacacc 


gcatgcttcc 


tcatccacat 


agtgccagca 


gcagtgccac 


2701 


gcagttcctc 


ctctccctcc 


cggtgagctg 


ctgccctggg 


cagaggggag 


gagaattcca 


2761 


ggacaagagt 


gtcaaggaca 


gggatttagc 


atatggaagt 


ctttcctttg 


ggtcagtatt 


2821 


gaactagaat 


tctaattcgg 


gactgggcaa 


ttgagctgta 


taggggccac 


cttgcaggga 


2881 


ggacagaaaa 


ctaacatttt 


ggcccaactt 


gatctataca 


aaactttaat 


aataccacta 


2941 


ctgaccaagt 


tggacgtgta 


cacgtactca 


cactgccttg 


atggccattc 


gattggattc 
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3001 


ctcccaaatt 


tcctaaaaag 


qqaqccqcqa 


aqqqcqctgq 


qcaqtqtq c 


c ccaacttc 


3061 


caccccggca 


agcccctctg 


tcctatqcaq 


aaaqqcqctc 


caqqqaaqqa 


agtgtcgttg 


3121 


ctgttagagc 


ctcacgtgga 


qqaqtcactt 


aaacaccaqt 


tttttact c 


ttaattcctt 


3181 


gttaggtctt 


ctcttgaggc 


tcttagaaaa 


qcqttttcca 


qaqaqatttc 


tatttttgaa 


3241 


caatggaacg 


gatcactgct 


tttttgccac 


atcacataat 


aactqccq 




3301 


acgga'ttcga 


ctctattcat 




aaccatoaac 


cgtggaacat 


tctta t t 


3361 


ggtigctitggg 


ttatgatggc 


aaaaqtcaao 


aaaaaaatta 


ctttcattct 


t t 


3421 


agtttctcta 


attatttgaa 


atqttcattt 


aacctttaat 


tttcactgat 


attaa^t ^ 








aaaggattat 


atagtaacta 




tggagcaatg 


3541 


tgatcagttt 


gcatttaaaa 


ggaaaaaaaa 


gaattttatc 


ttagccagaa 


tgtccctgga 


3601 


ttcaggggtg 


tctttgtata 


atatgagagg 


gccttgttcc 


aaggtcaagg 




3661 


attttacatg 


ctgtttgcca 


aatcttgttt 


cttagcttgg 


ggagatgatg 


gacttagctt 


3721 


cctcaagata 


aatttctagt 


ttattaagat 


gcaaacagct 






3781 


agaaaatctt 


atttttctga 


acattttcat 


gaatccaggg 


gacttgaaaa 


tatggaagac 


3841 


ccacatagtt 


agaagaatat 




attccttgct 


gctaagtcag 


atcagatttg 


3901 


ctaacaggaa 




catgacagta 


tcttgagtta 


tgtgagtttt 


ttttcctcct 


3961 


gactttgtgt 


tgattggtga 


aatgcagggt 


atgtggaagt 


tatctaatta 


acctcagttg 


4021 


tatatgaata 


acccacagat 


gtactgaatt 


acttttggtg 


ctatcttgta 


ctcttcaatc 


4081 


tgtaacacaa 


taaaatccct 


ttgtacgatg 


tctaatgagc 


accctgagcc 


ataaattgct 


4141 




attttgggtg 


att 









ERG cDNA sequence 1 fSEO ID NO: m 

NM_182918 Homo sapiens v-ets erythroblastosis virus E26 oncogene like (avian), (ERG), transcript 
variant 1 , mRNA 



1 




gctctaaaca 




aactactttc 


tggtcagaga 


gaagcaataa 


61 


ttattattaa 


catttattaa 




acttgattgc 


attatggcca 


gcactattaa 


121 


ggaagcctta 


tcagttgtga 


gtgaggacca 


gtcgttgttt 


gagtgtgcct 


acggaacgcc 


181 


acacctggct 


aagacagaga 


tgaccgcgtc 


ctcctccagc 


gactatggac 


agacttccaa 


241 


gatgagccca 


cgcgtccctc 


agcaggattg 


gctgtctcaa 


cccccagcca 


gggtcaccat 


301 


caaaatggaa 


tgtaacccta 


gccaggtgaa 


tggctcaagg 


aactctcctg 


atgaatgcag 


361 


tgtggccaaa 


ggcgggaaga 


tggtgggcag 




gttgggatga 


actacggcag 


421 


ctacatggag 


gagaagcaca 


tgccaccccc 




acgaacgagc 


gcagagttat 


481 


cgtgccagca 


gatcctacgc 


tatggagtac 


agaccatgtg 


cggcagtggc 


tggagtgggc 


541 


ggtgaaagaa 


tatggccttc 


cagacgtcaa 






tcgatgggaa 


601 


ggaactgtgc 


aagatgacca 


aggacgactt 


ccagaggctc 


acccccagct 


acaatgccga 


661 


catccttctc 






agagactcct 


cttccacatt 




721 


tgatgttgat 




aaaactctcc 


acggttaatg 


catgctagaa 


acacaggggg 


781 


tgcagctttt 




atacttcagt 


atatcctgaa 


gctacgcaaa 


gaattacaac 


841 


taggccagat 


ttaccatatg 


agccccccag 


gagatcagcc 


tggaccggtc 


acggccaccc 


901 


cacgccccag 


tcgaaagctg 




tccttccaca 


gtgcccaaaa 


ctgaagacca 


961 


gcgtcctcag 


ttagatcctt 


atcagattct 


tggaccaaca 


agtagccgcc 


ttgcaaatcc 


1021 


aggcagtggc 


cagatccagc 


tttggcagtt 


cctcctggag 


ctcctgtcgg 


acagctccaa 


1081 


ctccagctgc 


atcacctggg 


aaggcaccaa 


cggggagttc 


aagatgacgg 


atcccgacga 


1141 


ggtggcccgg 


cgctggggag 


agcggaagag 


caaacccaac 


atgaactacg 


ataagctcag 


1201 


ccgcgccctc 


cgttactact 


atgacaagaa 


catcatgacc 


aaggtccatg 


ggaagcgcta 


1261 


cgcctacaag 


ttcgacttcc 


acgggatcgc 


ccaggccctc 


cagccccacc 


ccccggagtc 


1321 


atctctgtac 


aagtacccct 


cagacctccc 


gtacatgggc 




cccacccaca 


1381 


gaagatgaac 


tttgtggcgc 


cccaccctcc 


agccctcccc 


gtgacatctt 


ccagtttttt 


1441 


tgctgcccca 


aacccatact 


ggaattcacc 


aactgggggt 


atatacccca 


acactaggct 


1501 


ccccaccagc 




ctcatctggg 


cacttactac 




gcggaggctt 


1561 




cgtgcattca 


ccagcccatc 


gccacaaact 


ctatcggaga 


acatgaatca 


1621 


aaagtgcctc 


aagaggaatg 


aaaaaagctt 


tactggggct 


ggggaaggaa 


gccggggaag 


1681 


agatccaaag 


actcttggga 


gggagttact 


gaagtcttac 


tacagaaatg 


aggaggatgc 


1741 


taaaaatgtc 


acgaatatgg 


acatatcatc 


tgtggactga 


ccttgtaaaa 


gacagtgtat 


1801 


gtagaagcat 


gaagtcttaa 


ggacaaagtg 


ccaaagaaag 


tggtcttaag 


aaatgtataa 


1861 


actttagagt 


agagtttgga 


atcccactaa 


tgcaaactgg 


gatgaaacta 


aagcaataga 


1921 


aacaacacag 


ttttgaccta 


acataccgtt 


tataatgcca 


ttttaaggaa 


aactacctgt 


1981 


atttaaaaat 


agaaacatat 


caaaaaaaaa 
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ERG cDNA sequence 2 (SEP ID NO: 121 

NM_004449 Homo sapiens v-ets erythroblastosis virus E26 oncogene like (avian), (ERG), transcript 



variant 2, mRNA 












1 


atgattcaga 


ctgtcccgga 


cccagcagct 


catatcaagg 


aagccttatc 


agttgtgagt 


61 


gaggaccagt 


cgttgtttga 


gtgtgcctac 


ggaacgccac 


acctggctaa 


gacagagatg 


121 


accgcgtcct 


cctccagcga 


ctatggacag 


acttccaaga 


tgagcccacg 


cgtccctcag 


181 


caggattggc 


tgtctcaacc 


cccagccagg 


gtcaccatca 






241 


caggtgaatg 


gctcaaggaa 


ctctcctgat 


gaatgcagtg 


tggccaaagg 


cgggaagatg 


301 


gtgggcagcc 


cagacaccgt 


tgggatgaac 


tacggcagct 


acatggagga 


gaagcacatg 


361 


ccacccccaa 


acatgaccac 


gaacgagcgc 


agagttatcg 


tgccagcaga 




421 


tggagtacag 


accatgtgcg 


gcagtggctg 


gagtgggcgg 


tgaaagaata 


tggccttcca 


481 


gacgtcaaca 


tcttgttatt 


ccagaacatc 


gatgggaagg 


aactgtgcaa 


gatgaccaag 


541 


gacgacttcc 


agaggctcac 


ccccagctac 


aacgccgaca 


tccttctctc 


acatctccac 


601 


tacctcagag 


agaotcctct 


tccacatttg 


acttcagatg 


atgttgataa 


agccttacaa 


661 


aactctccac 


ggttaatgca 


tgctagaaac 


acagatttac 


catatgagcc 


ccccaggaga 


721 


tcagcctgga 


ccggtcacgg 


ccaccccacg 


ccccagtcga 


aagctgctca 


accatctcct 


781 


tccacagtgc 


ccaaaactga 


agaccagcgt 


cctcagttag 


atccttatca 


gattcttgga 


841 


ccaacaagta 


gccgccttgc 


aaatccaggc 


agtggccaga 


tccagctttg 


gcagttcctc 


901 


ctggagctcc 


tgtcggacag 


ctccaactcc 


agctgcatca 


cctgggaagg 


caccaacggg 


961 


gagttcaaga 


tgacggatcc 


cgacgaggtg 


gcccggcgct 


ggggagagcg 


gaagagcaaa 


1021 


cccaacatga 


actacgataa 


gctcagccgc 


gccctccgtt 


actactatga 


caagaacatc 


1081 


atgaccaagg 


tccatgggaa 


gcgctacgcc 


tacaagttcg 


acttccacgg 


gatcgcccag 


1141 


gccctccagc 


cccacccccc 


ggagtcatct 


ctgtacaagt 


acccctcaga 




1201 


atgggctcct 


atcacgccca 


cccacagaag 


atgaactttg 


tggcgcccca 


ccctccagcc 


1261 


ctccccgtga 


catcttccag 


tttttttgct 


gccccaaacc 


catactggaa 


ttcaccaact 


1321 


gggggtatat 




taggctcccc 


accagccata 


tgccttctca 


tctgggcact 



1381 tactactaa 

[0270] Following are amino acid sequences for PSMBl (SEQ ID NO: 13), TBP (SEQ ID NO: 14), 
PDCD2 (SEQ ID NO: 15 and 16), ELP3 (SEQ ID NO: 17). CHDCl (SEQ ID NO: 18), and ERG (SEQ 
ID NO: 19 and 20). 

PSMBl amino acid sequence f SEP ID NO: 1 3-> 

NP_002784 Homo sapiens proteasome (prosome, macropain) subunit, beta type, 1 (PSMBl), protein 

MLSSTAMYSAPGRDLGMEPHRAAGPLQLRFSPYVFNGGTILAIAGEDFAIVASDTRLSE 
GFSIHTRDSPKCYKLTDKTVIGCSGFHGDCLTLTKIIEARLKMYKHSNNKAMTTGAIAA 
MLSTILYSRRFFPYYVYNIIGGLDEEGKGAVYSFDPVGSYQRDSFKAGGSASAMLQPLL 
DNQVGFKNMQNVEHVPLSLDRAMRLVKDVFISAAERDVYTGDALRICIVTKEGIREET 
VSLRKD 



TBP amino acid sequence fSEO ID NO: M") 

NP_003 1 85 Homo sapiens TATA box binding protein (TBP), protein 

MDQNNSLPPYAQGLASPQGAMTPGIPIFSPMMPYGTGLTPQPIQNTNSLSILEEQQRQQQ 
QQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQAVAAAAVQQSTSQQATQGT 
SGQAPQLFHSQTLTTAPLPGTTPLYPSPMTPMTPITPATPASESSGIVPQLQNIVSTVNLG 
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CKLDLKTIALRARNAEYNPKRFAAVIMRIREPRTTALIFSSGKMVCTGAKSEEQSRLAAR 

KYARWQKLGFPAKFLDFKIQNMVGSCDVKFPIRLEGLVLTHQQFSSYEPELFPGLIYR 

MKPRIVLLIFVSGKWLTGAKVRAEIYEAFENIYPILKGFRKTT 



PDCD2 amino acid sequence 1 (SEP ID NO: 15) 

NP_002589 Homo sapiens programmed cell death 2 (PDCD2), isoform 1, protein 

MAAAGARPVELGFAESAPAWRLRSEQFPSKVGGRPAWLGAAGLPGPQALACELCGRP 

LSFLLQVYAPLPGRPDAFHRCIFLFCCREQPCCAGLRVFRNQLPRKNDFYSYEPPSENPPP 

ETGESVCLQLKSGAHLCRVCGCLGPKTCSRCHKAYYCSKEHQTLDWRLGHKQACAQP 

DHLDHIIPDHNFLFPEFEIVIETEDEIMPEWEKEDYSEIIGSMGEALEEELDSMAKHESRE 

DKIFQKFKTQIALEPEQILRYGRGIAPIWISGENIPQEKDIPDCPCGAKRILEFQVMPQLLN 

YLKADRLGKSIDWGILAVFTCAESCSLGTGYTEEFVWKQDVTDTP 



PDCD2 amino acid sequence 2 (SEP ID NO: 1 6'> 

NP_659005 Homo sapiens programmed cell death 2 (PDCD2), isoform 2, protein 

MAAAGARPVELGFAESAPAWRLRSEQFPSKVGGRPAWLGAAGLPGPQALACELCGRP 
LSFLLQVYAPLPGRPDAFHRCIFLFCCREQPCCAGLRVFRNQLPRKNDFYSYEPPSENPPP 
ETGESVCLQLKSGAHLCRVCGCLGPKTCSRCHKAYYCSKEHQTLDWRLGHKQACAQP 
DHLDHIIPDHNFLFPEFEIVIETEDEIMPEWEKEDYSEIIGSMGKQFQDFIH 



ELP3 amino acid sequenc e fSEQ ID NO: 17) 

NP_060561 Homo sapiens elongation protein 3 homolog (S. cerevisiae) (ELP3), protein 

MRQKRKGDLSPAELMMLTIGDVIKQLIEAHEQGKDIDLNKVKTKTAAKYGLSAQPRLV 

DIIAAVPPQYRKVLMPKLKAKPIRTASGIAVVAVMCKPHRCPHISFTGNICVYCPGGPDS 

DFEYSTQSYTGYEPTSMRAIRARYDPFLQTRHRIEQLKQLGHSVDKVEFIVMGGTFMAL 

PEEYRDYFIRNLHDALSGHTSNNIYEAVKYSERSLTKCIGITIETRPDYCMKRHLSDMLT 

YGCTRLEIGVQSVYEDVARDTNRGHTVKAVCESFHLAKDSGFKVVAHMMPDLPNVGL 

ERDIEQFTEFFENPAFRPDGLKLYPTLVIRGTGLYELWKSGRYKSYSPSDLVELVARILA 

LVPPWTRVYRVQRDIPMPLVSSGVEHGNLRELALARMKDLGIQCRDVRTREVGIQEIHH 

KVRPYQVELVRRDYVANGGWETFLSYEDPDQDILIGLLRLRKCSEETFRFELGGGVSIV 

RELHVYGSWPVSSRDPTKFQHQGFGMLLMEEAERIAREEHGSGKIAVISGVGTRNYYR 

KIGYRLQGPYMVKMLK 
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CHDCl amino acid sequence fSEO ID NO: 18) 

NP_055931 Homo sapiens calponin homology (CH) domain containing 1 (CHDCl), protein 

MATPGSEPQPFVPALSVATLHPLHHPHHHHHHHQHHGGTGAPGGAGGGGGGSGGFNL 

PLNRGLERALEEAANSGGLNLSARKLKEFPRTAAPGHDLSDTVQADLSKNRLVEVPME 

LCHFVSLEILNLYHNCIRVIPEAIVNLQMLTYLNLSRNQLSALPACLCGLPLKVLIASNNK 

LGSLPEEIGQLKQLMELDVSCNEITALPQQIGQLKSLRELNVRRNYLKVLPQELVDLPLV 

KFDFSCNKVLVIPICFREMKQLQVLLLENNPLQSPPAQICTKGKVHIFKYLSIQACQIKTA 

DSLYLHTMERPHLHQHVEDGKKDSDSGVGSDNGDKRLSATEPSDEDTVSLNVPMSNIM 

EEEQIIKEDSCHRLSPVKGEFHQEFQPEPSLLGDSTNSGEERDQFTDRADGLHSEFMNYK 

ARAEDCEELLRIEEDVHWQTEGIISSSKDQDMDIAMIEQLREAVDLLQDPNGLSTDITER 

SVLNLYPMGSAEALELQDSALNGQIQLETSPVCEVQSDLTLQSNGSQYSPNEIRENSPAV 

SPTTNSTAPFGLKPRSVFLRPQRNLESIDPQFTIRRKMEQMREEKELVEQLRESIEMRLKV 

SLHEDLGAALMDGWLCHLVNHIRPRSVASfflVPSPAVPKLSMAKCRRNVENFLEACR 

KLGVPEADLCSPCDILQLDFRHIRKTVDTLLALGEKAPPPTSALRSRDLIGFCLVHILFIVL 

VYITYHWNALSA 



ERG amino acid sequence 1 (SEQ ID NO: 19) 

NP_891548 Homo sapiens v-ets erythroblastosis virus E26 oncogene like (avian), (ERG), isoform 1, 
protein 

MASTIKEALSWSEDQSLFECAYGTPHLAKTEMTASSSSDYGQTSKMSPRVPQQDWLS 

QPPARWIKMECNPSQWGSRNSPDECSVAKGGKMVGSPDTVGMNYGSYMEEKHMPP 

PNMTTNERRVIVPADPTLWSTDHVRQWLEWAVKEYGLPDVNILLFQNIDGKELCKMT 

KDDFQRLTPSYNADILLSHLHYLRETPLPHLTSDDVDKALQNSPRLMHARNTGGAAFIF 

PNTSVYPEATQRITTRPDLPYEPPRRSAWTGHGHPTPQSKAAQPSPSTVPKTEDQRPQLD 

PYQILGPTSSRLANPGSGQIQLWQFLLELLSDSSNSSCITWEGTNGEFKMTDPDEVARRW 

GERKSKPNMNYDKLSRALRYYYDKNIMTKVHGKRYAYKFDFHGIAQALQPHPPESSLY 

KYPSDLPYMGSYHAHPQKMNFVAPHPPALPVTSSSFFAAPNPYWNSPTGGIYPNTRLPT 

SHMPSHLGTYY 
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ERG amino acid sequence 2 (SEP ID NO: 20") 

NP_004440 Homo sapiens v-ets erythroblastosis virus E26 oncogene like (avian), (ERG), isoform 2, 

protein 

MIQTVPDPAAHIKEALSWSEDQSLFECAYGTPHLAKTEMTASSSSDYGQTSKMSPRVP 

QQDWLSQPPARVTIKMECNPSQVNGSRNSPDECSVAKGGJCMVGSPDTVGMNYGSYME 

EKHMPPPNMTTNERRVIVPADPTLWSTDHVRQWLEWAVKEYGLPDVNILLFQNIDGKE 

LCKMTKDDFQRLTPSYNADILLSHLHYLRETPLPHLTSDDVDKALQNSPRLMHARNTD 

LPYEPPRRSAWTGHGHPTPQSKAAQPSPSTVPKTEDQRPQLDPYQILGPTSSRLANPGSG 

QIQLWQFLLELLSDSSNSSCITWEGTNGEFKMTDPDEVARRWGERKSKPNMNYDKLSR 

ALRYYYDKNIMTKVHGKRYAYKFDFHGIAQALQPHPPESSLYKYPSDLPYMGSYHAHP 

QKMNFVAPHPPALPVTSSSFFAAPNPYWNSPTGGIYPNTRLPTSHMPSHLGTYY 



[0271] Modifications may be made to the foregoing without departing from the basic aspects of the 
invention. Although the invention has been described in substantial detail with reference to one or more 
specific embodiments, those of skill in the art will recognize that changes may be made to the 
embodiments specifically disclosed in this application, yet these modifications and improvements are 
within the scope and spirit of the invention, as set forth in the aspects which follow. All publications or 
patent documents cited in this specification are incorporated herein by reference as if each such 
publication or document was specifically and individually indicated to be incorporated herein by 
reference. 

[0272] Citation of the above publications or documents is not intended as an admission that any of 
the foregoing is pertinent prior art, nor does it constitute any admission as to the contents or date of these 
publications or documents. U.S. patents and other publications referenced herein are hereby incorporated 
by reference. 



230 



PATENT 
Docket 524593008700 



What is claimed is : 

1 . A method for identifying a subject at risk of osteoarthritis, which comprises detecting the 
presence or absence of one or more polymorphic variations associated with osteoarthritis in a nucleic acid 
sample from a subject, wherein the one or more polymorphic variations are detected in a nucleotide 
sequence selected from the group consisting of: 

(a) a nucleotide sequence in SEQ ID NO: 1-12; 

(b) a nucleotide sequence which encodes a polypeptide encoded by a nucleotide sequence in 
SEQ ID NO: 1-12; 

(c) a nucleotide sequence which encodes a polypeptide that is 90% or more identical to the 
amino acid sequence encoded by a nucleotide sequence in SEQ ID NO: 1-12; 

(d) a fragment of a nucleotide sequence of (a), (b), or (c); 

whereby the presence of the polymorphic variation is indicative of the subject being at risk of 
osteoarthritis. 

2. The method of claim 1 , which further comprises obtaining the nucleic acid sample from the 

subject. 

3 . The method of claim 1 , wherein the one or more polymorphic variations are detected within 
a region spanning chromosome positions 170719500 to 170766500 in human genomic DNA. 

- 4. The method of claim 1 , wherein the one or more polymorphic variations are detected at one 
or more positions in SEQ ID NO: 1 selected from the group consisting of 229, 6310, 1 1840, 1 1870, 
12064, 13392, 16354, 16559, 16935, 17616, 17737, 18321, 18453, 18811, 20020, 21662, 23197, 23446, 
24339, 25504. 27174, 28008, 29294, 29759, 30832, 44512, 44850, 45884, 46345, 48589, 53371, 5391 1, 
53990, 55152, 55667, 58952, 59315, 60029, 61477, 62988, 63090, 64021, 65685, 70220, 70323, 70959, 
73436, 82945, 82958, 82961, 82964, 82965, 83006, 83025, 83034, 83074 ,83132, 83155, 83172, 83174, 
83206, 83216, 83234, 83252, 83260, 83263, 83296, 83319, 83322, 83324, 83357, 83375, 83381, 83389, 
83443, 83499, 83545, 83566, 83591, 83619, 83698, 83780, 83784, 83826, 83832, 83852, 86297, 86315, 
86420, 86460, 86714, 86718, 86736, 86753, 86766, 88162, 88218, 88246, 88255, 88309, 88310, 88471, 
88619, 88904, 89044, 90531, 90534, 90613 and 46252. 
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5. The method of claim 1 , wherein the one or more polymorphic variations are detected at one 
or more positions in SEQ ID NO: 1 selected from the group consisting of 229, 6310, 16559, 18453, 
25504, 27174, 30832, 44850, 45884, 48589, 61477, 82961 and 46252. 

6. The method of claim I, wherein the one or more polymorphic variations are detected within 
a region spanning chromosome positions 27963000 to 27983000 in human genomic DNA. 

7. The method of claim 1 , wherein the one or more polymorphic variations are detected at one 
or more positions in SEQ ID NO: 2 selected from the group consisting of 21 1, 473, 1536, 5639, 17186, 
17335, 25029, 25111, 28811, 28863, 30809, 40985, 45147, 45282, 46168, 46328, 49077, 51925, 52141, 
52168, 60852, 62468, 65572, 79089, 79541, 79790, 90843, 90978, 91052, 91131, 91132, 94439 and 
94621. 

8. The method of claim 1 , wherein the one or more polymorphic variations are detected at one 
or more positions in SEQ ID NO: 2 selected from the group consisting of 40985, 46168, 51925 and 
52168. 

9. The method of claim 1 , wherein the one or more polymorphic variations are detected within 
a region spanning chromosome positions 44962000 to 45013000 in human genomic DNA. 

10. The method of claim 1, wherein the one or more polymorphic variations are detected at one 
or more positions in SEQ ID NO: 3 selected from the group consisting of 243, 10208, 15049, 15111, 
15272, 15287, 15326, 15327, 17038, 19391, 21702, 22431, 22881, 27744, 32564, 32698, 33104, 33181, 
33256, 33543, 35567, 40085, 40482, 45641, 46059, 48504, 48919, 49693, 49874, 50020, 50616, 50719, 
5551 1, 65533, 70529, 75591, 77266, 80368, 82475, 92462, 92480, 95819 and 96275. 

1 1 . The method of claim 1 , wherein the one or more polymorphic variations are detected at one 
or more positions in SEQ ID NO: 3 selected from the group consisting of 151 1 1, 45641 , 46059, 49693, 
49874, 50020, 50719, 70529, 82475, 92462, 92480 and 96275. 

12. The method of claim 1, wherein the one or more polymorphic variations are detected within 
a region spanning chromosome positions 38830000 to 38844000 in human genomic DNA. 

13. The method of claim 1, wherein the one or more polymorphic variations are detected at one 
or more positions in SEQ ID NO: 4 selected from the group consisting of 231, 882, 960, 1 194, 1530, 
1673, 2096, 2285, 5873, 7256, 7988. 8222, 8381, 8814, 8915, 9642, 9902, 10619, 10927, 11032, 14377, 
15608, 15928, 16296, 17598, 19272. 20084, 20577, 28051, 29466, 29530, 29987, 30012, 30322, 32216, 
32516, 32544, 32746, 33137, 33538, 33798, 33802, 33964, 34132, 34210, 34317, 34499, 34753, 34845, 
35335, 36423, 36450, 36481, 38447, 38784, 39387, 39458, 39822, 40305, 40869, 40926, 41010, 41 134, 
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41984, 


42172, 


42753, 


43011, 


43176, 43320, 


43381, 44142, 


44383, 


44726, 


45087, 


45141, 45359, 45421, 


45456, 


45467, 


45486, 


45709, 


45716, 47626, 


49413,49796, 


49962, 


50075, 


50093, 


50571, 50615, 50780, 


50851, 


51459, 


53193, 


53702, 


53736, 53795, 


54109, 54126, 


54230, 


54894, 


55455, 


55499, 56522, 56662, 


56954, 


57267, 


58282, 


58916, 


59544, 59666, 


59913, 66846, 


67245, 


67652, 


67955, 


67966, 68420, 70226, 


70810, 


72246, 


73330, 


73457, 


74389, 74638, 


74640, 75358, 


75952, 


76098, 


77836, 


78449, 78507, 80031, 


81695, 


82775, 


82795, 


84611, 


84657, 84693, 


85020, 85048, 


85100, 


85325, 


85452, 


85868, 85936, 85990, 


86139, 


86497, 


87236, 


87248, 


87533, 87912, 


88108, 88494, 89598, 90235, 91287, 


91359, 92384, 92410, 


92900, 


94495, 


94512, 


97777 and 98333. 













14. The method of claim 1, wherein the one or more polymorphic variations are detected at one 
or more positions in SEQ ID NO: 4 selected from the group consisting of 1673, 20577, 33137, 39822, 
45716, 49962, 51459, 54894, 55455, 55499, 58282, 68420 and 80031. 

1 5. The method of claim 1 , wherein the one or more polymorphic variations are detected at one 
or more positions in linkage disequilibrium with one or more positions in claim 4, 7, 10 or 13. 

16. The method of claim 1, wherein detecting the presence or absence of the one or more 
polymorphic variations comprises: 

hybridizing an oligonucleotide to the nucleic acid sample, wherein the oligonucleotide is 
complementary to a nucleotide sequence in the nucleic acid and hybridizes to a region adjacent to the 
polymorphic variation; 

extending the oligonucleotide in the presence of one or more nucleotides, yielding extension 
products; and 

detecting the presence or absence of a polymorphic variation in the extension products. 

17. The method of claim 1, wherein the subject is a human. 

1 8. The method of claim 17, wherein the subject is a human female. 

1 9. The method of claim 1 7, wherein the subject is a human male. 

20. A method for identifying a polymorphic variation associated with osteoarthritis proximal to 
an incident polymorphic variation associated with osteoarthritis, which comprises: 

identifying a polymoqjhic variation proximal to the incident polymorphic variation associated 
with osteoarthritis, wherein the polymorphic variation is detected in a nucleotide sequence selected from 
the group consisting of: 
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(a) a nucleotide sequence in SEQ ED NO: 1-12; 

(b) a nucleotide sequence which encodes a polypeptide encoded by a nucleotide sequence in 
SEQ ID NO: 1-12; 

(c) a nucleotide sequence which encodes a polypeptide that is 90% or more identical to the 
amino acid sequence encoded by a nucleotide sequence in SEQ ID NO: 1-12; 

(d) a fragment of a nucleotide sequence of (a), (b), or (c) comprising a polymorphic 

variation; 

determining the presence or absence of an association of the proximal polymorphic variant with 
osteoarthritis. 

21 . The method of claim 20, wherein the incident polymorphic variation is at one or more 
positions in claim 4, 7, 10 or 13. 

22. The method of claim 20, wherein the proximal polymorphic variation is within a region 
between about 5 kb 5' of the incident polymorphic variation and about 5 kb 3' of the incident 
polymorphic variation. 

23. The method of claim 20, which further comprises determining whether the proximal 
polymorphic variation is in linkage disequilibrium with the incident polymorphic variation. 

24. The method of claim 20, which fvirther comprises identifying a second polymorphic variation 
proximal to the identified proximal polymorphic variation associated with osteoarthritis and determining 
if the second proximal polymorphic variation is associated with osteoarthritis. 

25. The method of claim 24, wherein the second proximal polymorphic variant is within a region 
between about 5 kb 5' of the incident polymorphic variation and about 5 kb 3' of the proximal 
polymorphic variation associated with osteoarthritis. 

26. An isolated nucleic acid comprising a nucleotide sequence selected from the group consisting 

of: 

(a) a nucleotide sequence in SEQ ID NO: 1-12; 

(b) a nucleotide sequence which encodes a polypeptide encoded by a nucleotide sequence in 
SEQ ID NO: 1-12; 
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(c) a nucleotide sequence which encodes a polypeptide that is 90% or more identical to the 
amino acid sequence encoded by a nucleotide sequence in SEQ ID NO: 1-12; 

(d) a fragment of a nucleotide sequence of (a), (b), or (c) comprising a polymorphic 
variation; and 

(e) a nucleotide sequence complementary to the nucleotide sequences of (a), (b), (c), or (d); 

wherein the nucleotide sequence comprises a polymorphic variation associated with osteoeirthritis 
selected from the group consisting of in SEQ ID NO: 1 a thymine at position 229, a guanine at position 
6310, a thymine at position 16559, an adenine at position 18453, an adenine at position 25504, an 
adenine at position 27174, an adenine at position 30832, a guanine at position 44850, an adenine at 
position 45884, an adenine at position 48589, a cytosine at position 61477, a cytosine at position 82961 
and a thymine at position 46252; in SEQ ID NO: 2 a cytosine at position 40985, a guanine at position 
46168, a thymine at position 51925 and a cytosine at position 52168; in SEQ ID NO: 3 a guanine at 
position 151 1 1, a thymine at position 45641, an adenine at position 46059, a cytosine at position 49693, 
an adenine at position 49874, an adenine at position 50020, a guanine at position 50719, an adenine at 
position 70529, an adenine at position 82475, a thymine at position 92462, a thymine at position 92480 
and a cytosine at position 96275; and in SEQ ID NO: 4 a guanine at position 1673, a thymine at position 
20577, a guanine at position 33137, a guanine at position 39822, an adenine at position 45716, a guanine 
at position 49962, an adenine at position 51459, a cytosine at position 54894, an adenine at position 
55455, an adenine at position 55499, a guanine at position 58282, an adenine at position 68420 and a 
thymine at position 8003 1 . 

27. An oligonucleotide comprising a nucleotide sequence complementary to a portion of the 
nucleotide sequence of (a), (b), (c), or (d) in claim 26, wherein the 3' end of the oligonucleotide is 
adjacent to a polymorphic variation associated with osteoarthritis. 

28. A microarray comprising an isolated nucleic acid of claim 26 linked to a soHd support. 

29. An isolated polypeptide encoded by the isolated nucleic acid sequence of claim 26. 

30. A method for identifying a candidate therapeutic for treating osteoarthritis, which comprises: 

(a) introducing a test molecule to a system which comprises a nucleic acid comprising a 
nucleotide sequence selected from the group consisting of: 

(i) a nucleotide sequence in SEQ ID NO : 1 - 1 2; 
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(ii) a nucleotide sequence which encodes a polypeptide encoded by a nucleotide 
sequence in SEQ ID NO: 1-12; 

(iii) a nucleotide sequence which encodes a polypeptide that is 90% or more identical 
to the amino acid sequence encoded by a nucleotide sequence in SEQ ID NO: 1-12; 

(iv) a fragment of a nucleotide sequence of (a), (b), or (c); or 

introducing a test molecule to a system which comprises a protein encoded by a nucleotide 
sequence of (i), (ii), (iii), or (iv); and 

(b) determining the presence or absence of an interaction between the test molecule and the 
nucleic acid or protein, 

whereby the presence of an interaction between the test molecule and the nucleic acid or protein 
identifies the test molecule as a candidate therapeutic for treating osteoarthritis. 

31 . The method of claim 30, wherein the system is an animal. 

32. The method of claim 30, wherein the system is a cell. 

33. The method of claim 30, wherein the nucleotide sequence comprises one or more 
polymorphic variations associated with osteoarthritis. 

34. The method of claim 33, wherein the one or more polymorphic variations associated with 
osteoarthritis are at one or more positions in claim 4, 7, 10 or 13. 

35. A method for treating osteoarthritis in a subject, which comprises contacting one or more 
cells of a subject in need thereof with a nucleic acid, wherein the nucleic acid comprises a nucleotide 
sequence selected from the group consisting of: 

(a) a nucleotide sequence in SEQ ID NO: 1-12; 

(b) a nucleotide sequence which encodes a polypeptide encoded by a nucleotide sequence in 
SEQ ID NO: 1-12; 

(c) a nucleotide sequence which encodes a polypeptide that is 90% or more identical to the 
amino acid sequence encoded by a nucleotide sequence in SEQ ID NO: 1-12; 

(d) a fragment of a nucleotide sequence of (a), (b), or (c); and 

(e) a nucleotide sequence complementary to the nucleotide sequences of (a), (b), (c), or (d); 
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whereby contacting the one or more cells of the subject with the nucleic acid treats the 
osteoarthritis in the subject. 

36. The method of claim 35, wherein the nucleic acid is RNA or PNA. 

37. The method of claim 36, wherein the nucleic acid is duplex RNA. 

38. A method for treating osteoEirthritis in a subject, which comprises contacting one or more 
cells of a subject in need thereof with a protein, wherein the protein is encoded by a nucleotide sequence 
which comprises a polynucleotide sequence selected fix)m the group consisting of: 

(a) a nucleotide sequence in SEQ ID NO: 1-12; 

(b) a nucleotide sequence which encodes a polypeptide encoded by a nucleotide sequence in 
SEQ ID NO: 1-12; 

(c) a nucleotide sequence which encodes a polypeptide that is 90% or more identical to the 
amino acid sequence encoded by a nucleotide sequence in SEQ ID NO: 1-12; 

(d) a fragment of a nucleotide sequence of (a), (b), or (c); 

whereby contacting the one or more cells of the subject with the protein treats the osteoarthritis 
in the subject. 

39. A method for treating osteoarthritis in a subject, which conqjrises: 

detecting the presence or absence of one or more polymorphic variations associated with 
osteoarthritis in a nucleic acid sample fi^om a subject, wherein the one or more polymorphic variation are 
detected in a nucleotide sequence selected from the group consisting of: 

(a) a nucleotide sequence in SEQ ID NO: 1-12; 

(b) a nucleotide sequence which encodes a polypeptide encoded by a nucleotide sequence in 
SEQ ID NO: 1-12; 

(c) a nucleotide sequence which encodes a polypeptide that is 90% or more identical to the 
amino acid sequence encoded by a nucleotide sequence in SEQ ID NO: 1-12; 

(d) a fragment of a nucleotide sequence of (a), (b), or (c) comprising a polymorphic 
variation; and 
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administering an osteoarthritis treatment to a subject in need thereof based upon the presence or 
absence of the one or more polymorphic variations in the nucleic acid sample. 

40. The method of claim 39, wherein the one or more polymorphic vetriations are detected at one 
or more positions in claim 4, 7, 10 or 1 3. 

41 . The method of claim 39, wherein the treatment is selected from the group consisting of 
administering a corticosteroid, a nonsteroidal anti-inflammatory drug (NSAID), a cyclooxygenase-2 
(COX-2) inhibitor, an antibody, a glucocorticoid, hyaluronic acid, chondrotin sulfate, glucosamine or 
acetaminophen; prescribing a heat/cold regimen or a joint protection regimen; performing joint surgery; 
prescribing a weight control regimen; and combinations of the foregoing. 

42. A method for detecting or preventing osteoarthritis in a subject, which comprises: 

detecting the presence or absence of one or more polymorphic variations associated with 
osteoarthritis in a nucleic acid sample from a subject, wherein the polymorphic variation is detected in a 
nucleotide sequence selected from the group consisting of: 

(a) a nucleotide sequence in SEQ ID NO: 1-12; 

(b) a nucleotide sequence which encodes a polypeptide encoded by a nucleotide sequence in 
SEQ ID NO: 1-12; 

(c) a nucleotide sequence which encodes a polypeptide that is 90% or more identical to the 
amino acid sequence encoded by a nucleotide sequence in SEQ ID NO: 1-12; 

(d) a fragment of a nucleotide sequence of (a), (b), or (c) comprising a polymorphic 
variation; and 

administering an osteoarthritis prevention or detection procedure to a subject in need thereof 
based upon the presence or absence of the one or more polymorphic variations in the nucleic acid sample. 

43. The method of claim 42, wherein the one or more polymorphic variations are detected at one 
or more positions in claim 4, 7, 10 or 13. 

44. The method of claim 42, wherein the osteoarthritis prevention is selected from the group 
consisting of administering a corticosteroid, a nonsteroidal anti-inflammatory drug (NSAID), a 
cyclooxygenase-2 (COX-2) inhibitor, an antibody, a glucocorticoid, hyaluronic acid, chondrotin sulfate, 
glucosamine or acetaminophen; prescribing a heat/cold regimen or a joint protection regimen; performing 
joint surgery; prescribing a weight control regimen; and combinations of the foregoing. 
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45. A method of targeting infonnation for preventing or treating osteoarthritis to a subject in 
need thereof, which comprises: 

detecting the presence or absence of one or more polymorphic variations associated with 
osteoarthritis in a nucleic acid sample from a subject, wherein the polymorphic variation is detected in a 
nucleotide sequence selected from the group consisting of: 

(a) a nucleotide sequence in SEQ ID NO: 1-12; 

(b) a nucleotide sequence which encodes a polypeptide encoded by a nucleotide sequence in 
SEQ ID NO: 1-12; 

(c) a nucleotide sequence which encodes a polypeptide that is 90% or more identical to the 
amino acid sequence encoded by a nucleotide sequence in SEQ ID NO: 1-12; 

(d) a fragment of a nucleotide sequence of (a), (b), or (c) comprising a polymorphic 
variation; and 

directing information for preventing or treating osteoarthritis to a subject in need thereof based 
upon the presence or absence of the one or more polymorphic variations in the nucleic acid sample. 

46. The method of claim 45, wherein the one or more polymorphic variations are detected at one 
or more positions in claim 4, 7, 10 or 13. 

47. A composition comprising a cell from a subject having osteoarthritis or at risk of 
osteoarthritis and an antibody that specifically binds to a protein, polypeptide or peptide encoded by a 
nucleotide sequence identical to or 90% or more identical to a nucleotide sequence in SEQ ID NO: 1-12. 

48. A composition comprising a cell from a subject having osteoarthritis or at risk of 
osteoarthritis and a RNA, DNA, PNA or ribozyme molecule comprising a nucleotide sequence identical 
to or 90% or more identical to a portion of a nucleotide sequence in SEQ ID NO: 1-12. 

49. The composition of claim 48, wherein the RNA molecule is a short inhibitory RNA 
molecule. 
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Abstract of the Disclosure 

Provided herein are methods for identifying a risk of osteoarthritis in a subject, reagents and kits 
for carrying out the methods, methods for identifying candidate therapeutics for treating osteoarthritis, 
and therapeutic and preventative methods applicable to osteoarthritis. These embodiments are based 
upon an analysis of polymorphic variations in nucleotide sequences within the human genome. 
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